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NUCLEAR FERTILITY RESTORER GENES AND 
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BACKGROUND OF THE INVENTION 
Field of the Invention 

[0001] This invention relates generally to nucleic acid sequences encoding proteins that 
restore fertility in a plant. 
Background Art 

[0002] Considerable effort is being devoted to the development of strategies to increase 
canola yields due to the importance of canola crops worldwide. One method of obtaining 
increased yields of canola involves the generation of hybrid canola plants. Due to a 
phenomenon termed "hybrid vigor", hybrid canola plants are higher yielding than non-hybrid 
canola plants (Grant, I. and Beversdorf, W., 1985, Can. JTGenet. Cytol. 27:472-478). In fact, 
manually produced hybrids can yield up to 50% more seed than either of their parental lines. 
To produce such hybrids on a large scale, however, self-pollination of at least one of the 
parents of the hybrid cross must be prevented. One means of preventing self-pollination is to 
incorporate the trait of cytoplasmic male sterility into a seed parent of the hybrid. 

[0003] Cytoplasmic male sterility (CMS) results in an inability of the plant to produce 
viable pollen. In some cases, pollen formation is blocked or aborted in a CMS plant because 
of a gene in a cytoplasmic organelle, the mitochondrion. This widespread and classic non- 
Mendelian trait results from rearrangements of the mitochondrial genome (Schnable, P.S. and 
Wise, R.P., 1998, Trends in Plant Sci., 3:175-180). Plants carrying the CMS trait are 
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incapable of self-pollination, and therefore, when a CMS line is planted alongside a male- 
fertile line, all the seed that forms on the sterile plants is a hybrid of the two parents. 
[0004] Importantly however, use of the CMS trait in a hybridization scheme produces 
seeds that are male-sterile since, in most species, the trait is inherited maternally. While the 
fertility of the resultant seeds is unimportant in some crops (i.e., vegetables), fertility must be 
restored in the crops for which pollen production is required for formation of the harvested 
products, as in the case of fruit or seed crops such as canola. In order to restore fertility to the 
hybrids, specific dominant nuclear genes termed restorers of fertility (Rf) can be introduced 
into the hybrid plants to suppress the male-sterile phenotype (Schnable, P.S. and Wise, R.P., 

1998, _Trends in Plant Sci., 3:175-180). Accordingly, the use of CMS for commercial seed 
production involves the use of three breeding lines, a male-sterile line (female parent), a 
maintainer line which is isogenic to the male-sterile line but does not contain a sterility 
inducing mitochondrial genome and a restorer line (male parent). 

[0005] A crop of particular interest herein is the oilseed crop of the species Brassica 
napus, commonly referred to as canola. A number of CMS systems have been reported in 
Brassica species. Five of the systems most commonly used for hybrid seed production are 
Polima {pot), nap, tournefortii, Kosena and Ogura {ogu). The fomi of CMS in Brassica 
napus which is currently thought to be potentially the most useful for hybrid seed production 
is the oga system. The ogu system is based on the use of a hybrid cytoplasm in which the 
male sterility determinant is derived from a radish {Raphanus sathwm) cytoplasm. Male 
sterility induced by ogu cytoplasm is more complete and more temperature stable than any of 
the other endogenous B. napus CMS systems. Analysis of the oga mitochondrial genome has 
indicated that this form of CMS is specified by a novel open reading frame (ORF), orfl 38, 
that encodes a polypeptide, ORF138 (Grelon et al., 1994, Mol. Gen. Genet. 243:540-547). 

[0006] Recently, a Brassica napus restorer-line for the ogu system became available 
(Delourme, R. et al., 1995, Proc. 9 th Int. Rapseed Cong. Cambridge, UK 1:6-8). Using this 
restorer line, it was determined that restoration of fertility resulted in a decrease of the 
ORF138 protein in stamens as compared to un-restored, ogu sterile lines (Bellaui, M. et al., 

1999, Plant Mol. Biol. 40:893-902). However, a drawback to these prior art oga restorer 
lines is that hybrids produced using these lines have elevated glucosinolate levels. An 
elevation of glucosinolate levels in plants is problematic when the plants are used in animal 
feed because this compound causes digestive problems in animals. Elevated glucosinolate 
levels are undesirable in canola plants in particular since much of their value is derived from « 
their low levels of glucosinolate compounds. 
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[0007] The elevation of glucosinolate levels results from a dominant gene that is linked to 
the radish nuclear fertility restorer gene or genes, termed Rfo in the prior art. Rfo, like the ogii 
cytoplasm, has been introduced from the radish but recombination in the radish chromosomal 
region surrounding Rfo is suppressed in B. napus (Deloumie R. et aL, 1998, Theor. Appl. 
Genet. 97:129-134). Despite considerable effort by several groups, it has not yet been 
possible to develop stable B. napus lines in which Rfo has been efficiently dissociated from 
the glucosinolate gene and do not address other deficiencies in Rfo restorer lines, and 
therefore, the system is not widely implemented. 

[00081 Accordingly, what are needed in the art are improved lines of canola that can be 
used as -restorers of fertility in hybridization systems. More particularly, it would be 
beneficial to provide restorer lines of canola containing one~or more nuclear fertility restorer 
genes from Raphanus sativum, which genes are separated from the gene or genes causing 
increased levels of glucosinolate in the resultant hybrid plants. The present invention also 
provides a method of using the fertility restorer nucleic acid to select for transgenic plant 
cells by means of its capacity to restore pollen production to cytoplasmic male sterile plants. 

SUMMARY OF THE INVENTION 

[0009] This invention fulfills in part the need to isolate a nuclear fertility restorer locus 
and genes and provide improved restorer lines for plants, and canola in particular. The 
present invention provides a Rfo restorer region that contains a genus of isolated nuclear 
fertility restorer genes. In one embodiment, the nuclear fertility restorer genus is derived 
from a radish, comprises a pentatricopeptide (PPR) motif and is able to restore fertility in a 
male-sterile plant. In another preferred embodiment, the nuclear fertility restorer genus is 
lacking genes associated with increased glucosinolate traits. In a preferred embodiment, the 
male^sferile plant comprises the ogu male sterility determinant and the nuclear fertility 
restorer gene is derived from Raphanus sativum. In a more preferred embodiment, the male- 
sterile plant is Brassica napus. 

[0010] The present invention provides a nuclear fertility restorer genus as shown in SEQ 

ED NO: 87. In a preferred embodiment, the present invention provides a nuclear fertility 

restorer genus, located within Genes 14 through 30, as shown between positions 88,073 and 

198,041 of SEQ ID NO:87. In a preferred embodiment, the nuclear fertility restorer gene is 

selected from Genes 15, 16, 17, 21, 22, 24, 26 and 27, as defined herein. In a preferred 

embodiment, the nuclear fertility restorer gene is a nucleotide sequence selected from SEQ 

ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:48 

SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. In a preferred embodiment, the nuclear 
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fertility restorer gene encodes a protein comprising an amino acid sequence of Gene product 
15, 16, 17, 21, 22, 24, 26 and 27, as defined herein. In a preferred embodiment, the nuclear 
fertility restorer gene encodes a protein comprising an amino acid sequence selected from 
SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:41 5 SEQ ID NO:43, SEQ ID 
NO:47, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88. In a preferred embodiment, the 
nuclear fertility restorer gene comprises a nucleotide sequence of Gene 16 as shown in SEQ 
ID NO:32 or Gene 26 as shown in SEQ ID NO:89. In a preferred embodiment, the nuclear 
fertility restorer gene encodes a protein comprising an amino acid sequence as shown in SEQ 
ID NO:31 or SEQ ID NO:88. 

J.0011] The invention further provides an isolated plant transformation vector-comprising 
a nuclear fertility restorer gene as described below, wherein expression of the vector in a host 
plant results in the plant's increased production of viable pollen. In a preferred embodiment, 
the host cells are located in a plant stamen, or more particularly, a plant anther. 
[0012] The present invention also provides plant cells, plant parts, plant seeds and plants 
comprising the nuclear fertility restorer genes, proteins and vectors described herein. In one 
embodiment, a plant seed according to the present invention comprises a nuclear fertility 
restorer nucleic acid, and accordingly, the plant seed is true breeding for the ability to restore 
fertility in a male-sterile plant. The invention further provides an agricultural product 
produced by any of the below-described plants, plant parts or plant seeds. 
[0013] The invention additionally provides a method of producing a hybrid plant 
comprising crossing a male-sterile plant with a restorer plant, wherein the restorer plant 
contains a nuclear fertility restorer nucleic acid described herein. The present invention also 
provides a method of restoring male fertility in a plant comprising introducing a nuclear 
fertility restorer nucleic acid into a male-sterile plant. The present invention also provides a 
Tnethod of increasing the production of viable pollen in a plant, including introducing a 
nuclear fertility restorer nucleic acid into a plant. The present invention also provides 
methods of using genetic markers from the sequences described herein to determine the 
presence of a nuclear fertility restorer genus in a plant. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 . Flowers of Rfo-fertihty restored (A) and Ogura (pgu) cytoplasmic male sterile radish 

■ 

(Raphanus sativum) (B). Panel C allows direct comparison of fertility restored (left) and ogu 
CMS floral morphology. 
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Fig. 2. Genetic mapping of the radish Rfo locus with respect to B. napus (black lettering) and 
Arabidopsis BAC derived probes (underlined). Genetic distance is indicated in centimorgans 
(cM). 

Fig. 3A. Representation of radish genetic maps in the Rfo region based on genetic mapping of 
Arabidopsis BAC-derived probes (map positions are indicated by dots on the solid lines; only 
two of the mapped makers, LI 2 and L40, are labeled). The lower bar represents the position 
of the same markers on the Physical map of the Arabidopsis genome. The lines between the 
two bars indicate the relationship between the physical position of the markers in Arabidopsis 
and the genetic position in radish. Note-the duplication (rBl_and rB2) and inversion (rB2) of 
ArabidopsisHregion B in the radish genetic map. Fig. 3B. Proposed physical relationship 
between the Arabidopsis and radish genomes in the Rfo region. The inversion of markers in 
the rB2 region of radish with respect to Arabidopsis suggest that the Rfo gene can be 
approached by employing Arabidopsis-derived markers from region A moving from positions 
c to a. 

Fig. 4. BAC and cosmid contig spanning the Rfo containing region of the radish genome. 
The horizontal bars represent different the region contained in different clones; BAC clones 
are indicated by the shaded bars. The symbols on the bars (triangles, squares, circles etc.) 
indicate the position of different markers used in radish genetic mapping or clone recovery 
experiments. 

Fig. 5. Schematic representation of the annotation for the portion of the 270 kb radish 
genomic sequence containing the Rfo locus. The sizes and locations of different predicted 
genes within the region are indicated by the smaller arrows below the long arrow that depicts 
the corresponding numerical coordinates of the sequence. The arrows indicate the direction 
of transcription; for genes indicated by arrows pointing to the right, the sense strand sequence 
is given in the Appendix; for arrows pointing to the left, the sense strand sequence will be the 
reverse complement of that given in the Appendix. Sequences and genes contained in 
different pRD400 (solid lines) and pOCAlS (dashed lines) clones used for transformation are 
indicated beneath the annotation schematic. 

Figs. 6 A and 6B. Flowers of the Brassica napus Ogura CMS line used in transformation 

experiments (6A) and a TO transgenic plant transformed with the contract P2-Sh31, 

containing Gene 1 6 (6B). Note the poorly developed stamens and anthers in the CMS line 
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and the larger, normal, fertile stamens and anthers in the transgenic Gene 16 fertility restored 
plant. Note too the yellow petal color of B. napus flowers in comparison to the white petals 
of radish flowers. Flowers of the fertile plant recovered from transformation with the Bgl-5 
construct containing Gene 26 are identical in appearance to those of the Gene 16 

* 

transform ants. 

Figs. 7A and 7B. Left Panel: RT-PCR products generated using an oligo dT adapter primer 
and a primer internal to the Gene 16 open reading frame (3 5 RACE) from total floral RNA of 
restored fertile (lane 4) and ogu CMS (lane 5) radish plants. Lane 1: DNA size markers (1Kb 
DNA ladder). Lanes 2 and 3 are RT-PCR product&-from controls provided with the 3* RACE 
kit. Lane 6 is gene- specific positive control. Right panel: 5' RACE products generated from 
total RNA of restored fertile (lane 4) and ogu CMS radish (lane 5) plants using a primer 
internal to the Gene 16 open reading frame and an adaptor primer for the 5' end. Lanes 1,2, 
3 and 6 are the equivalents of those in the left panel. 

Fig. 8. Schematic depicting the structure of Gene 16 mRNA. Exons are indicated by boxes 
and intron positions and sizes by the diagonal lines connecting the exon boxes. Positions of 
the start (ATG), stop (TAA) and polyadenylation sites are numbered with reference to the 
Sequence shown in SEQ ID NO:87. 

Fig. 9 Structure of the protein encoded by Gene 16 (Gene 16p), as deduced from the sequence 
of the full length cDNA. 

Fig. 10. Comparisons of the proteins encoded by Gene 16 (Gene 16p) and Gene 15 (Gene 
15p). Identical amino acids are indicated by dark shading^_Sites at which an amino acid is 
replaced by a similar but non-identical amino acid are indicated by lighter shading. 

Fig. 11.5' RACE products generated from total RNA of restored fertile (lane 2) radish plants 

using a primer internal to the Gene 26 open reading frame and an adaptor primer for the 5 * 

end. Lane 3 is a gene-specific positive control. RT-PCR products generated using an oligo 

dT primer and a primer internal to the Gene 26 open reading frame (3' RACE) from total 

floral RNA of restored fertile (lane 5) radish plants. Lanes 1 & 8: DNA size markers (1Kb 

DNA ladder). Lane 6: an RT-PCR product for a control provided with the 5' RACE kit. 

Lanes 4 and 7 demonstrate that no product is generated when the adaptor primer for the 5*end 

or the oligo dT adaptor primer was omitted from the reactions respectively. 
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Fig. 12. Schematic depicting the structure of Gene 26 raRNA. Translation start and stop 
sites and polyadenylation site are as indicated in Fig. 8. 

Fig. 13. Structural features of the protein encoded by Gene 26 (Gene 26p). The predicted N 
terminal mitochondrial targeting presequence is enclosed in the open boxes. Shaded regions 
indicate copies of the PPR domain repeats. 

DETAILED DESCRIPTION OF THE INVENTION 

[0014] The present invention may be understood more~readily by reference to the 
following detailed description of the preferred embodiments of the invention and the 
Examples included herein. However, before the present compounds, compositions, and 
methods are disclosed and described, it is to be understood that this invention is not limited to 
specific nucleic acids, specific polypeptides, specific cell types, specific host cells, specific 
conditions, or specific methods, etc., as such may, of course, vary, and the numerous 
modifications and variations therein will be apparent to those skilled in the art. It is also to be 
understood that the terminology used herein is for the purpose of describing specific 
embodiments only and is not intended to be limiting. 

[0015] This invention fulfills in part the need to isolate a nuclear fertility restorer locus 
and genes and provide improved restorer lines for plants, and canola in particular. The 
present invention provides a Rfo restorer region that contains a genus of isolated nuclear 
fertility restorer genes. In one embodiment, the nuclear fertility restorer genus is derived 
from a radish, comprises pentatricopeptide (PPR) motifs and is able to restore fertility in a 
male-sterile plant. In another preferred embodiment, the nuclear fertility restorer genus" is 
lacking genes associated with increased glucosinolate traits. In a preferred embodiment, the 
male-sterile plant comprises the ogu male sterility determinant and the nuclear fertility 
restorer gene is derived from Raphanus sativum. In a more preferred embodiment, the male- 
sterile plant is Brassica napus. 

[0016] The present invention provides a nuclear fertility restorer genus as shown in SEQ 

ID NO:87. In a preferred embodiment, the present invention provides a nuclear fertility 

restorer genus, located within Genes 14 through 30 as defined herein, as shown between 

positions 88,073 and 198,041 of SEQ ID NO:87. In a preferred embodiment, the nuclear 

fertility restorer gene is selected from Genes 15, 16, 17, 21, 22, 24, 26 and 27, as defined 

herein. In a preferred embodiment, the nuclear fertility restorer gene is a nucleotide sequence 

7 



WO 03/006622 PCT/US02/22217 

selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ ID 
NO:44, SEQ ID NO:48, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. In a preferred 
embodiment, the nuclear fertility restorer gene encodes a protein comprising an amino acid 
sequence of Gene product 15, 16, 17, 21, 22, 24, 26 and 27, as defined herein. In a preferred 
embodiment, the nuclear fertility restdrer gene encodes a protein comprising an amino acid 
sequence selected from SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:41, 
SEQ ED NO:43, SEQ ID NO:47, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88. In a 
preferred embodiment, the nuclear fertility restorer gene encodes a protein comprising an 
amino acid sequence as shown in SEQ ID NO:31 or SEQ ID NO:88. 

J0017] The invention further provides-an isolated plant transformation vector comprising 
a nuclear fertility restorer gene as described below, wherein expression of the vector in a host 
plant results in the plant's increased production of viable pollen. In a preferred embodiment, 
the host cells are located in a plant stamen, or more particularly, a plant anther. 

c 

[0018] The present invention also provides plant cells, plant parts, plant seeds and plants 
comprising the nuclear fertility restorer genes, proteins and vectors described herein. In one 
embodiment, a plant seed according to the present invention comprises a nuclear fertility 
restorer nucleic acid, and accordingly, the plant seed is true breeding for the ability to restore 
fertility in a male-sterile plant. The invention further provides an agricultural product 
produced by any of the below-described plants, plant parts or plant seeds. 

[0019] The invention additionally provides a method of producing a hybrid plant 
comprising crossing a male-sterile plant with a restorer plant, wherein the restorer plant 
contains a nuclear fertility restorer nucleic acid described herein. The present invention also 
provides a method of restoring male fertility in a plant comprising introducing a nuclear 
fertility restorer nucleic acid into a male-sterile plant. The present invention also provides a 
method of increasing the production of viable pollen in a plant, including introducing a 
nuclear fertility restorer nucleic acid into a plant. The present invention also provides 
methods of using genetic markers from the sequences described herein to determine the 
presence of a nuclear fertility restorer genus in a plant. 

[0020] A novel discovery described herein comprises the identification of the nucleic acid 
sequence that encodes the Rfo genetic locus in Raphanns sathntm associated with restoration 
of fertility in male-sterile plants. It is to be understood that the Rfo genetic locus includes one 
or more introns, one or more exons, or a combination thereof. 

[0021] The present invention provides in a preferred embodiment specific genes from the 
genomic DNA of a radish Ogura restorer line. Each of these genes can, individually, 
completely restore male fertility to CMS lines carrying the Ogura cytoplasm. The genes each 
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encode apparently unrelated proteins. The availability of the isolated genes makes it possible 
to produce a restorer line by introducing the isolated gene or genes into B. napus plants for 
example by plant transformation. The resulting plants carry reduced agronomic deficiencies 
associated with the presence of J?/o-region radish DNA in B. napus, including the gene that 
elevates seed glucosinolate content. This method for producing B. napus restorer lines for 
ogu CMS is faster and less costly than any other currently known practice. 
[0022] While the genes of the present invention that restore ogu CMS bear no 
resemblance to the other restorer genes that have been characterized at the DNA or protein 
sequence levels, the invention provides that nuclear restorer genes for other CMS systems in 
other crops can be homo logs of the radish -Rfo genes. Thus, the knowledge of the molecular 
identity of fti€~Rfo genes presented herein allows the facile isolation of nuclear restorers for 
many other crop species in which CMS is employed in hybrid production, such as sunflower 
and rice. 

[0023] The examples herein provide evidence that multiple specific genes within the 
sequence listed in the Appendix as SEQ ID NO:87 can function as fertility restorer genes. 
Specifically, Gene 16 (SEQ ID NO:32) and Gene 26 (SEQ ID NO:89) are demonstrated to 
restore fertility in a CMS system. The invention provides that other genes within the region 
can also function as restorer genes. For example, two partially fertile plants were obtained 
after transformation with the Gene 15 (SEQ ID NO:30) construct. Thus Gene 15, like Gene 
1 6, can function as a restorer gene. Similarly, two partially fertile plants were recovered after 
transformation with the Gene 17 (SEQ ID NO:34) construct. A plant transformed with a 
construct containing Gene 21 (SEQ ID NO:42) and Gene 22 (SEQ ID NO:44) also produced 
some fertile flowers. Gene 24 (SEQ ID NO:48) and Gene 27 (SEQ ID NO:54) also contain 
PPR domains and are therefore expected to restore fertility phenotype according to the 
present invention. As discussed aboveTthe preliminary analysis indicated that the restorer 
functions are most preferably located between Gene 14 and Gene 30 within the Rfo region as 
shown in the Appendix between positions 88,073 and 198,041 of SEQ ID NO:87. 
[0024] There are several reasons why not all of the transformants recovered following 
transformation with a specific construct necessarily show the same phenotype. The 
expression of the genes encoded in a specific construct may vary depending on the site of 
insertion, the number of copies of the gene at each insertion site, and other factors such as 
transgene silencing. Therefore, it is apparent that other genes in the genetically defined Rfo 
containing region will likely be found by routine analysis in view of the present disclosure to 
function as genes that confer complete fertility restoration. Given that Rfo maps as a single 
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genetic locus in radish, the invention provides that two or more different genes in the region 
are able to function as restorer genes for Ogura CMS in B. napus. 

[0025] The sequences given in the Appendix have utility in the development of effective, 
glucosinolate-free or glucosinolate-reduced restorer lines through approaches other than the 
transgenic approach illustrated above. In one envisioned example of this embodiment, any 
portion of the sequence can be used to design DNA markers that allow the radish Rfo region 
to be specifically detected in plant breeding experiments aimed a separating the high seed 
glucosinolate trait from the restorer gene. Such markers allow the identification of individual 
plants that contain the restorer gene region but have a minimal amount of associated radish 
DNA. This in turn aids significantly in the development of alternative low glucosinolate 
Ogura restorer lines. 

[0026] As mentioned above, the present invention provides a genus of isolated Rfo genes 
and proteins encoded thereby. Some members include a pentatricopeptide (PPR) motif, or 
consensus sequence, and are able to restore fertility in a male-sterile plant. As used herein, 
"PPR motif includes the following amino acid consensus sequence: 
VTYNTLISGYCKAGKLEEALELFKEMKEKGIKPDV (SEQ ID NO:90). A Rfo protein is 
defined herein as a protein whose amino acid sequence has one or more 35 amino acid 
regions having greater than 50% homology with a PPR motif or consensus sequence, and 
preferably comprises at least the single methionine and the tyrosine at position three of the 
motif As also used herein, the term "male-sterile plant" refers to a plant wherein the 
function of the male organs is disrupted or blocked, or more particularly, wherein the male 
organs are unable to produce viable pollen. The term "male-sterile" includes genie male 
sterility and cytoplasmic-genetic male sterility. The terms "cytoplasmic-genetic male sterile" 
and "cytoplasmic male sterile" are used interchangeably herein. In a preferred embodiment, 
the nucleic acids and proteins 6T the present invention are used to restore fertility in a 
cytoplasmic male-sterile, or CMS, plant. In a further preferred embodiment, the cytoplasmic 
male-sterile plant comprises a cytoplasmic male sterility determinant selected from the group 
consisting of pol, nap, Tournefortii, Kosena and ogu. In a most preferred embodiment, the 
cytoplasmic male sterility determinant is ogu. 

[0027] The male-sterile plants of the present invention can be selected from maize, wheat, 
rye, oat, triticale, rice, barley, soybean, peanut, rapeseed, canola, pepper, sunflower, Vicia 
species, pea, alfalfa, bushy plants (coffee, cacao, tea), Salix species, radish, sorghum, pearl 
millet, cotton, and tobacco. It is preferable however that the male-sterile plant is a canola 
plant selected from the group of Brassica species consisting of Brassica napus, Brassica rapa 
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(or campestris), Brassica oleracea, Brassica nigra, Brassica juncea, Sinapis alba, and 
Brassica carinata. In a more preferred embodiment, the male-sterile plant is Brassica napus. 
[0028] The present invention encompasses a Rfo gene derived from a species of radish, 
including but not limited to, Raphanus sativum. In one embodiment of the present invention, 
the Rfo gene is derived from Raphanus sativum. Accordingly, the present invention provides 
a Rfo gene that, upon its introduction into a male-sterile plant, is able to increase the plant's 
production of pollen and/or restore the fertility of the plant. 

[0029] The invention further includes isolated amino acid sequences encoded by the Rfo 
genes provided herein. In a preferred embodiment, the nuclear fertility restorer protein 
(hereinafter "Rf protein" or "Rf amino acid sequence") comprises an amino acid sequence as^ 
shown herein. The present invention also includes homologs, orthologs and paralogs of the 
amino acid sequences shown herein. Homologs, orthologs and paralogs are further defined 

below. 

[0030] The terms "gene", "nucleic acid" and "nucleic acid molecule" are used 
interchangeably herein and are intended to include DNA molecules (e.g., cDNA or genomic 
DNA) and RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using 
nucleotide analogs. This term also encompasses untranslated sequence located at both the 3' 
and 5' ends of the coding region of the gene: up to at least about 1000 nucleotides of 
sequence upstream from the 5' end of the coding region and up to at least about 200 
nucleotides of sequence downstream from the 3' end of the coding region of the gene. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is double- 

T 

stranded DNA. 

[0031] An "isolated" nucleic acid molecule is one that is substantially separated from 
other nucleic acid molecules that are present in the natural source of the nucleic acid (i.e., 
sequences encoding other proteins). Preferably, an "isolated" nucleic acid is free of some of 
the sequences that naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' 
ends of the nucleic acid) in its naturally occurring replicon. For example, a cloned nucleic 
acid is considered isolated. In one embodiment of the present invention, a Rfo nucleic acid is 
isolated when it is separated from all or part of the glucosinolate gene, for example in 
Raphanus sativum. In other various embodiments, the isolated Rfo nucleic acid molecule can 
contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences 
which naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 
nucleic acid is derived (e.g., a Raphanus sativum cell). A nucleic acid is also considered 
isolated if it has been altered by human intervention, or placed in a locus or location that is 
not its natural site, or if it is introduced into a cell by Agrobacterium-mediated 
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transformation. Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, 
can be free from some of the other cellular material with which it is naturally associated, or 
culture medium when produced by recombinant techniques, or chemical precursors or other 
chemicals when chemically synthesized. 

[0032] Specifically excluded from the definition of "isolated nucleic acids" are: 
naturally-occurring chromosomes (such as chromosome spreads), artificial chromosome 
libraries, genomic libraries, and cDNA libraries that exist either as an in vitro nucleic acid 
preparations or as a transfected/transformed host cell preparation, wherein the host cells are 
either an in vitro heterogeneous preparation or plated as a heterogeneous population of single 
colonies. Also specifically excluded are the above libraries wherein a specified nucleic acid 
makes up less than 5% of the number~of nucleic acid inserts in the vector molecules. Further 
specifically excluded are whole cell genomic DNA or whole cell RNA preparations 
(including whole cell preparations that are mechanically sheared or enzymatically digested). 
Even further specifically excluded are the whole cell preparations found as either an in vitro 
preparation or as a heterogeneous mixture separated by electrophoresis wherein the nucleic 
acid of the invention has not further been separated from the heterologous nucleic acids in the 
electrophoresis medium (e.g., further separating by excising a single band from a 
heterogeneous band population in an agarose gel or nylon blot). 

[0033] A nucleic acid molecule of the present invention, or a portion thereof, can be 
isolated using standard molecular biology techniques and the sequence information provided 
herein. For example, a Rfo cDNA can be isolated from a Raphanus sativum library using all 
or a portion of the sequence herein. Moreover, a nucleic acid molecule encompassing all or a 
portion of sequence herein can be isolated by the polymerase chain reaction using 
oligonucleotide primers designed based upon this sequence. For example, mRNA can be 
isolated from radish cells (e.g., by the guanidinium-thiocyanate extraction procedure of 
Chirgwin et aL, 1979 Biochemistry 18:5294-5299) and cDNA can be prepared using reverse 
transcriptase (e.g., Moloney. MLV reverse transcriptase, available from Gibco/BRL, 
Bethesda, MD; or AMV reverse transcriptase, available from Seikagaku America, Inc., St. 
Petersburg, FL). Synthetic oligonucleotide primers for polymerase chain reaction 
amplification can be designed based upon the nucleotide sequence shown herein. A nucleic 
acid molecule of the invention can be amplified using cDNA or, alternatively, genomic DNA, 
as a template and appropriate oligonucleotide primers according to standard PCR 
amplification techniques. The nucleic acid molecule so amplified can be cloned into an 
appropriate vector and characterized by DNA sequence analysis. Furthermore, 
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oligonucleotides corresponding to a Rfo nucleotide sequence can be prepared by standard 
synthetic techniques, e.g., using an automated DNA synthesizer. 

[0034] In a preferred embodiment, an isolated nucleic acid molecule of the invention 
comprises one of the nucleotide sequences shown herein. It is to be understood that 
sequences shown herein comprise whole genomic fragments isolated from genomic DNA. 
Accordingly, SEQ ID NO:87, for example contains both coding regions and 5' and 3' 
untranslated regions that can include promoters and other regulatory sequences. 
Alternatively, the nucleic acid molecules of the present invention can comprise only the 
coding region of SEQ ID NO:87. A coding region of these sequences is indicated as an 
"ORF position". The present invention alsojncludes Rfo coding nucleic acids Jhat encode 
Rfo proteins as described herein. 

[0035] Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the coding region of the sequences shown herein, for example, a fragment which can be 
used as a probe or primer or a fragment encoding a biologically active portion of a Rfo 
protein. The nucleotide sequences determined from the cloning of the Rfo genes from 
Raphanus sativum allow for the generation of probes and primers designed for use in 
identifying and/or cloning Rfo homologs in other cell types and organisms, as well as Rfo 
homologs from other radishes and related species. 

[0036] Portions of proteins encoded by the Rfo nucleic acid molecules of the invention are 
preferably biologically active portions of one of the Rfo proteins described herein. As used 
herein, the term "biologically active portion of a Rfo protein is intended to include a portion, 
e.g., a domain/motif, of a Rfo that participates in the restoration of fertility in a cytoplasmic 
male-sterile plant. In a preferred embodiment, the biologically active portion of a Rfo protein 
comprises one or more PPR motifs as described above. To determine whether a Rfo protein, 
or a biologically active portion thereof, can restore fertility in a cytoplasmic male-sterile 
plant, a fertility analysis of a plant comprising the Rfo protein may be performed. Such 
analysis methods are well known to those skilled in the art. More specifically, nucleic acid 
fragments encoding biologically active portions of a Rfo protein can be prepared by isolating 
a portion of sequences shown herein, introducing the isolated portion of nucleic acid into a 
male-sterile plant and assessing whether male-fertility is restored. A determination as to 
whether male-fertility is restored in a plant can be made, for example, by 1) visually 
assessing an increase in the production of pollen as compared to a male-sterile plant or 2) 
determining that the plant can self-fertilize as evidenced by placing a bag over a flower on the 
plant and finding an increase of seed therein as compared to a male-sterile plant. It is to be 
understood that a male-sterile plant containing an ogu cytoplasmic male sterility determinant 

■ 
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can produce a small amount of pollen. In one embodiment of the present invention, 
restoration of fertility in a male-sterile plant is indicated by an increase in the plant's pollen 
production by at least 95%. 

[0037] Biologically active portions of a Rfo proteins are encompassed by the present 
invention and include peptides comprising amino acid sequences derived from the amino acid 
sequence of a Rfo protein, or the amino acid sequence of a protein homologous to a Rfo 
protein, which includes fewer amino acids than a full length Rfo protein or the full length 
protein which is homologous to a Rfo protein, and exhibit at least one activity of a Rfo 
protein. Typically, biologically active portions (e.g., peptides which are, for example, 5, 10, 
15, 20, 30, 35, 36, 37, 38, 39, 40,_50, 100 or more amino acids in length)-comprise a domain 
or motif with at least one activity of a Rfo protein. Moreover, other biologically active 
portions in which other regions of the protein are deleted, can be prepared by recombinant 
techniques and evaluated for one or more of the activities described herein. Preferably, the 
biologically active portions of a Rfo protein include one or more PPR domains/motifs or 
portions thereof and are able to restore fertility in a cytoplasmic male-sterile plant. 

[0038] The invention also provides Rfo chimeric or fusion proteins. As used herein, a 
Rfo "chimeric protein" or "fusion protein" comprises a Rfo polypeptide operatively linked to 
a .non-Rfo polypeptide. A Rfo polypeptide refers to a polypeptide having an amino acid 
sequence corresponding to a Rfo protein, whereas a non-Rfo polypeptide refers to a 
polypeptide having an amino acid sequence corresponding to a protein which is not 
substantially homologous to the Rfo, e.g., a protein that is different from the Rfo and is 
derived from the same or a different organism. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the Rfo polypeptide and the non-Rfo 
polypeptide are fused to each other so that both sequences fulfill the proposed function 
attributed to the sequence used. The non-Rfo polypeptide can be fused to the N-terminus or 
C-terminus of the Rfo polypeptide. For example, in one embodiment, the fusion protein is a 
GST-Rfo fusion protein in which the Rfo sequences are fused to the C-terminus of the GST 
sequences. Such fusion proteins can facilitate the purification of recombinant Rfo proteins. 
In another embodiment, the fusion protein is a Rfo protein containing a heterologous signal 
sequence at its N-terminus. 

[0039] Preferably, a Rfo chimeric or fusion protein of the invention is produced by 

standard recombinant DNA techniques. For example, DNA fragments coding for the 

different polypeptide sequences are ligated together in-frame in accordance with conventional 

techniques, for example by employing blunt-ended or stagger-ended termini for ligation, 

restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as 

14 



WO 03/006622 PCTAJS02/2221 7 

appropriate, alkaline phosphatase treatment to avoid undesirable joining and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene 
fragments can be carried out using anchor primers that give rise to complementary overhangs 
between two consecutive gene fragments which can subsequently be annealed and re- 
amplified to generate a chimeric gene sequence (see, for example, Current Protocols in 
Molecular Biology, Eds. Ausubel et al. John Wiley & Sons: 1992). Moreover, many 
expression vectors are commercially available that already encode a fusion moiety (e.g., a 
GST polypeptide). A Rfo encoding nucleic acid can be cloned into such an expression vector 

— such that the fusion moiety is linked in-frame to the Rfo protein. 

[0040] In addition to fragments and fusion proteins of the Rfo proteins described herein, 
the present invention includes homologs and analogs of naturally occurring Rfo proteins and 
Rfo encoding nucleic acids in a plant. "Homologs" are defined herein as two nucleic acids or 
proteins that have similar, or "homologous", nucleotide or amino acid sequences, 
respectively. Homologs include allelic variants, orthologs, paralogs, agonists and antagonists 
of Rfos as defined hereafter. The term "homolog" further encompasses nucleic acid 
molecules that differ from the nucleotide sequence shown herein (and portions thereof) due to 
degeneracy of the genetic code and thus encode the same Rfo protein as that encoded by the 
nucleotide sequences shown herein. As used herein a "naturally occurring" Rfo protein refers 
to a Rfo amino acid sequence that occurs in nature. 

[0041] Nucleic acid molecules corresponding to natural homologs such as allelic variants, 
orthologs and paralogs and natural analogs of a Rfo cDNA can be isolated based on their 
identity to the Raphanus sativum Rfo nucleic acids described herein. These natural homologs 
and analogs can be isolated using Rfo cDNAs, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. In 
an alternative embodiment, homologs of the Rfo protein can be identified by screening 
combinatorial libraries of mutants, e.g., truncation mutants, of the Rfo nucleic acids for Rfo 
protein agonist or antagonist activity. In one embodiment, a variegated library of Rfo variants 
is generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a 
variegated gene library. A variegated library of Rfo variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences 
such that a degenerate set of potential Rfo sequences is expressible as individual 
polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage display) 
containing the set of Rfo sequences therein. There are a variety of methods that can be used 
to produce libraries of potential Rfo homologs from a degenerate oligonucleotide sequence. 
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Chemical synthesis of a degenerate gene sequence can be performed in an automatic DNA 
synthesizer, and the synthetic gene is then ligated into an appropriate expression vector. Use 
of a degenerate set of genes allows for the provision, in one mixture, of all of the sequences 
encoding the desired set of potential Rfo sequences. Methods for synthesizing degenerate 
oligonucleotides are known in the art (see, e.g., Narang, S.A., 1983 Tetrahedron 39:3; Itakura 
et al., 1984 Annu. Rev. Biochem. 53:323; Itakura et al., 1984 Science 198:1056; Ike et al., 
1983 Nucleic Acid Res. 1 1 All). 

[0042] In addition, libraries of fragments of the Rfo coding regions can be used to 
generate a variegated population of Rfo fragments for screening and subsequent selection of 
homologs of a Rfo. Intone embodiment, a library of coding sequence fragments can be 
generated by treating a double stranded PCR fragment of a Rfo coding sequence with a 
nuclease under conditions wherein nicking occurs only about once per molecule, denaturing 
the double stranded DNA, renaturing the DNA to form double stranded DNA, which can 
include sense/antisense pairs from different nicked products, removing single stranded 
portions from reformed duplexes by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this method, an expression library can be 
derived which encodes N-terminal, C-terminal and internal fragments of various sizes of the 
Rfo proteins. 

[0043] Several techniques are known in the art for screening gene products of 

combinatorial libraries made by point mutations or truncation, and for screening cDNA 

libraries for gene products having a selected property. Such techniques are adaptable for 

rapid screening of the gene libraries generated by the combinatorial mutagenesis of Rfo 

homologs. The most widely used techniques, which are amenable to high through-put 

analysis, for screening large gene libraries typically include cloning the gene library into 

replicable expression vectors, transforming appropriate cells with the resulting library of 

vectors, and expressing the combinatorial genes under conditions in which detection of a 

desired activity facilitates isolation of the vector encoding the gene whose product was 

detected. Recursive ensemble mutagenesis (REM), a new technique that enhances the 

frequency of functional mutants in the libraries, can be used in combination with the 

screening assays to identify Rfo homologs (Arkin and Yourvan, 1992 PNAS 89:7811-7815; 

Delgrave et al., 1993 Protein Engineering 6(3):327-331). In another embodiment, cell based 

assays can be exploited to analyze a variegated Rfo library, using methods well known in the 

art. The present invention further provides a method of identifying a novel Rfo protein, 

comprising (a) raising a specific antibody response to a Rfo protein, or a fragment thereof, as 

described above; (b) screening putative Rfo protein material with the antibody, wherein 
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specific binding of the antibody to the material indicates the presence of a potentially novel 
Rfo protein; and (c) analyzing the bound material in comparison to known Rfo proteins, to 
determine its novelty. 

[0044] Preferably, the above described Rfo homologs retain the same biological activity 
as the Rfo proteins shown herein, and more preferably, the Rfo homologs restore fertility in a 
cytoplasmic male-sterile plant. To determine the percent homology of two amino acid 
sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of one protein for optimal alignment with the other protein). The 
amino acid residues at corresponding amino acid positions are then compared. When a 
position in_one~sequence is occupied by the same amino acid residue as the corresponding — 
position in the other sequence, then the molecules are homologous lit that position (i.e., as 
used herein amino acid or nucleic acid "homology" is equivalent to amino acid or nucleic 
acid "identity"). The same type of comparison can be made between two nucleic acid 
sequences. 

[0045] The percent homology between the two sequences is a function of the number of 
identical positions shared by the sequences (i.e., % homology = numbers of identical 
positions/total numbers of positions x 100). Preferably, the isolated Rfo protein homologs 
included in the present invention are at least about 50-60%, preferably at least about 60-70%, 
and more preferably at least about 70-80%, 80-90%, 90-95%, and most preferably at least 
about 96%, 97%, 98%, 99% or more homologous to an entire amino acid sequence shown in 
herein. In yet another embodiment, the isolated Rfo protein homologs included in the present 
invention are at least about 50-60%, preferably at least about 60-70%, and more preferably at 
least about 70-80%, 80-90%, 90-95%, and most preferably at least about 96%, 97%, 98%, 
99% or more homologous to an entire amino acid sequence encoded by a nucleic acid 
sequence shown herein^ In other embodiments, the isolated Rfo protein homologs have ~ 
homology over at least 15 contiguous amino acid residues, more preferably at least 25 
contiguous amino acid residues, and most preferably at least 35 contiguous amino acid 
residues of the sequences shown herein. In a further preferred embodiment, the Rfoo 
homologs have greater than 90% homology over the PPR motif 

[0046] In another preferred embodiment, an isolated Rfo nucleic acid homolog of the 
invention comprises a nucleotide sequence which is at least about 50-60%, preferably at least 
about 60-70%, more preferably at least about 70-80%, 80-90%, or 90-95%, and even more 
preferably at least about 95%, 96%, 97%, 98%, 99% or more homologous to a nucleotide 
sequence shown herein, or a portion thereof The preferable length of sequence comparison 
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for nucleic acids is at least 75 nucleotides, more preferably at least 100 nucleotides and most 
preferably the entire coding region of the nucleic acid. 

[0047] With regard to the present invention, a determination of the percent homology 
between two sequences is accomplished using a mathematical algorithm. In a preferred 
embodiment of the present invention, the percent homology between two sequences is 
determined using the mathematical algorithm of Karlin and Altschul (1990 Proc. Natl. Acad. 
Sci. USA 90:5873-5877). Such an algorithm is incorporated into the NBLAST and XBLAST 
programs of Altschul, et al. (1990 J. Mol. Biol. 215:403-410). Accordingly, the present 
invention includes a Rfo nucleic acid homolog having at least 50% homology with the 
nucleotide sequence shown herein as-determined using the NBLAST program,_score=100, 
wordlength=12. Additionally, the present invention-includes a Rfo amino acid homolog 
having at least 70% homology with the amino acid sequence shown herein as determined 
using the XBLAST program, score=50, wordlength=3 . When BLAST programs are used to 
determine percent homology, Gapped BLAST is utilized as described in Altschul et al. (1997 
Nucleic Acids Res. 25:3389-3402). When utilizing BLAST and Gapped BLAST programs, 
the default parameters of the respective programs (e.g., XBLAST and NBLAST) are used. 

[0048] In another embodiment of the present invention, the percent homology between 
two sequences is determined using the mathematical algorithm of Smith and Waterman. In 
yet another embodiment, the percent homology between two sequences is determined using 
the mathematical algorithm of Myers and Miller (CABIOS 1989). The Myers and Miller 
algorithm is incorporated into the ALIGN program (version 2.0) that is part of the GCG 
sequence alignment software package. When utilizing the ALIGN program for comparing 
amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12 and a gap 
penalty of 4 is used to obtain Rfo amino acid homologs. 

[0049] Finally, homology between nucleic acid sequences can be determined using 
hybridization techniques known to those of skill in the art. Accordingly, an isolated Rfo 
nucleic acid molecule of the invention comprises a nucleotide sequence which hybridizes, 
e.g., hybridizes under stringent conditions, to the nucleotide sequence shown herein or a 
portion thereof. More particularly, an isolated nucleic acid molecule of the invention is at 
least 15 nucleotides in length and hybridizes under stringent conditions to the nucleic acid 
molecule comprising a nucleotide sequence of herein. In other embodiments, the nucleic acid 
is at least 30, 50, 100, 250 or more nucleotides in length. Preferably, an isolated nucleic acid 
homolog of the invention comprises a nucleotide sequence which hybridizes under highly 
stringent conditions to the nucleotide sequence shown herein and restores fertility when 
expressed in a cytoplasmic male- sterile plant. 
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[0050] As used herein with regard to hybridization, the term "stringent conditions'* refers 
to 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes 
in 0.2 to 0.5 X SSC, 0.1 to 0.5% SDS at 50 to 68°C. Additionally, the term "highly stringent 
conditions" refers to 6X SSC at about 45°C, followed by one or more washes in 0.5 X SSC, 
0.5% SDS at 68°C. Preferably, an isolated nucleic acid molecule of the invention that 
hybridizes under stringent or highly stringent conditions to a sequence herein corresponds to 
a naturally occurring nucleic acid molecule. As used herein, a "naturally occurring" nucleic 
acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs 
in nature (e.g., encodes a natural protein). In one embodiment, the nucleic acid encodes a 
naturally occurring Raphanus sativum RfoT protein. 

[0051] Using the above-described methods, and others known to those of skill in the art, 
one of ordinary skill in the art can isolate homologs of the Rfo nucleic acids comprising a 
nucleotide sequence shown in SEQ ID NO: 1 and Rfo proteins comprising an amino acid 
sequence shown in SEQ ID NO:4, SEQ ID NO:5 or SEQ ID NO:6. One subset of these 
homologs comprises allelic variants. As used herein, the term "allelic variant" refers to a 
nucleotide sequence containing polymorphisms that lead to changes in the amino acid 
sequences of a Rfo protein and that exist within a natural population (e.g., a plant species or 
variety). Such natural allelic variations can typically result in 1-5% variance in a Rfo nucleic 
acid. Allelic variants can be identified by sequencing the nucleic acid sequence of interest in 
a number of different radish plants, which can be readily carried out by using hybridization 
probes to identify the same Rfo genetic locus in those radish plants. Any and all such nucleic 
acid variations and resulting amino acid polymorphisms or variations in a Rfo protein that are 
the result of natural allelic variation and that do not alter the functional activity of a Rfo 
protein, are intended to be within the scope of the invention. 

[0052] Moreover, nucleic acid molecules encoding Rfo proteins from the same or other 
species such as Rfo analogs, orthologs and paralogs, are intended to be within the scope of the 
present invention. As used herein, the term "analogs" refers to two nucleic acids that have 
the same or similar function, but that have evolved separately in unrelated organisms. As 
used herein, the term "orthologs" refers to two nucleic acids from different species, but that 
have evolved from a common ancestral gene by speciation. Normally, orthologs encode 
proteins having the same or similar functions. As also used herein, the term "paralogs" refers 
to two nucleic acids that are related by duplication within a genome. Paralogs usually have 
different functions, but these functions may be related (Tatusov, R.L. et al. 1997 Science 
278(5338):63 1-637). 
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[0053] Analogs, orthologs and paralogs of a naturally occurring Rfo nucleic acids can 
encode proteins that differ from a naturally occurring Rfo protein by post-translational 
modifications, by amino acid sequence differences, or by both. Post-translational 
modifications include in vivo and in vitro chemical derivatization of polypeptides, e.g., 
acetylation, carboxylation, phosphorylation, or glycosylation, and such modifications may 
occur during polypeptide synthesis or processing or following treatment with isolated 
modifying enzymes. In particular, orthologs of the invention will generally exhibit at least 
80-85%, more preferably 90%, and most preferably 95%, 96%, 97%, 98% or even 99% 
identity or homology with all or part of a naturally occurring Rfo amino acid sequence and 
will exhibit a function similar_to a Rfo protein. Preferably, a Rfo ortholog of the present 
invention restores fertility in a cytoplasmic male-sterile plant; More preferably^ a Rfo 
ortholog restores fertility in a cytoplasmic male-sterile Brassica napus plant. 

[0054] In addition to naturally occurring variants of a Rfo sequence that may exist in the 
population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into a nucleotide sequence shown herein, thereby leading to changes in the amino 
acid sequence of the encoded Rfo protein, without altering the functional activity of the Rfo 
protein. For example, nucleotide substitutions leading to amino acid substitutions at "non- 
essential" amino acid residues can be made in the sequences. A 4 'non-essential" amino acid 
residue is a residue that can be altered from the wild-type sequence of one of the Rfo proteins 
without altering the activity of said Rfo protein, whereas an "essential" amino acid residue is 
required for Rfo protein activity. Other amino acid residues, however, (e.g., those not within 
the PPR motif described above) may not be essential for activity and thus are likely to be 
amenable to alteration without altering Rfoo activity. 

[0055] Accordingly, an isolated nucleic acid molecule encoding a Rfo protein 
homologous to a protein sequence herein can be created by introducing one or more 
nucleotide substitutions, additions or deletions into a nucleotide sequence such that one or 
more amino acid substitutions, additions or deletions are introduced into the encoded protein. 
Mutations can be introduced into one of the sequences by standard techniques, such as site- 
directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid 
substitutions are made at one or more predicted non-essential amino acid residues. A 
"conservative amino acid substitution" is one in which the amino acid residue is replaced 
with an amino acid residue having a similar side chain. 

[0056] Families of amino acid residues having similar side chains have been defined in 
the art. These families include amino acids with basic side chains (e.g., lysine, arginine, 
histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains 
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(e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side 
chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side 
chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential 
amino acid residue in a Rfo is preferably replaced with another amino acid residue from the 
same side chain family. Alternatively, in another embodiment, mutations can be introduced 
randomly along all or part of a Rfo coding sequence, such as by saturation mutagenesis, and 
the resultant mutants can be screened for a Rfo activity described herein to identify mutants 
that retain Rfo activity. Following mutagenesis of the sequence, the encoded protein can be 
expressed and the activity of the protein can be determined by analyzing the viable pollen 
production of a plant expressing the protein as described above. For example, a Rfo mutant 
that retains activity restores fertility in a male-sterile plant upon its expression in the plant. 
[0057] Additionally, optimized Rfo nucleic acids can be created. Preferably, an optimized 
Rfo nucleic acid encodes a Rfo that restores fertility in a cytoplasmic male-sterile plant, and 
more particularly cytoplasmic male-sterile Brassica napus plant. As used herein, 
"optimized" refers to a nucleic acid that is genetically, engineered to increase its expression in 
a given plant or animal. To provide plant optimized Rfo nucleic acids, the DNA sequence of 
the gene can be modified to 1) comprise codons preferred by highly expressed plant genes; 2) 
comprise an A+T content in nucleotide base composition to that substantially found in plants; 
3) form a plant initiation sequence, 4) eliminate sequences that cause destabilization, 
inappropriate polyadenylation, degradation and termination of RNA, or that fonn secondary 
structure hairpins or RNA splice sites. Increased expression of Rfo nucleic acids in plants can 
be achieved by utilizing the distribution frequency of codon usage in plants in general or a 
particular plant such as Brassica napus. 

[0058] As used herein, "frequency of preferred codon usage" refers to the preference 
exhibited by a specific host cell in usage of nucleotide codons to specify a given amino acid. 
To determine the frequency of usage of a particular codon in a gene, the number of 
occurrences of that codon in the gene is divided by the total number of occurrences of all 
codons specifying the same amino acid in the gene. Similarly, the frequency of preferred 
codon usage exhibited by a host cell can be calculated by averaging the frequency of 
preferred codon usage in a large number of genes expressed by the host cell. It is preferable 
that this analysis be limited to genes that are highly expressed by the host cell. The percent 
deviation of the frequency of preferred codon usage for a synthetic gene from that employed 
by a host cell is calculated first by determining the percent deviation of the frequency of 
usage of a single codon from that of the host cell followed by obtaining the average deviation 
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over all codons. As defined herein, this calculation includes unique codons (i.e., ATG and 
TGG). In general terms, the overall average deviation of the codon usage of an optimized 
gene from that of a host cell is calculated using the equation lA = n = lZX n -Y n Xn times 
100 Z where X n = frequency of usage for codon n in the host cell; Y n = frequency of usage 
for codon n in the synthetic gene, n represents an individual codon that specifies an amino 
acid and the total number of codons is Z. The overall deviation of the frequency of codon 
usage, A, for all amino acids should preferably be less than about 25%, and more preferably 
less than about 10%. 

[0059] Hence, a Rfo nucleic acid can be optimized such that its distribution frequency of 
codon usage deviates, preferably, no more than 25% from that of highly expressed plant 
genes and, more preferably, no more than about 10%. In addition, consideration is given to 
the percentage G+C content of the degenerate third base (monocotyledons appear to favor 
G+C in this position, whereas dicotyledons do not). It is also recognized that , the XCG 
(where X is A, T, C, or G) nucleotide is the least preferred codon in dicots whereas the XTA 
codon is avoided in both monocots and dicots. Optimized Rfo nucleic acids of this invention 
also preferably have CG and TA doublet avoidance indices closely approximating those of 
the chosen host plant (i.e., Brassica napus). More preferably these indices deviate from that 
of the host by no more than about 10-15%. 

[0060] In addition to the nucleic acid molecules encoding the Rfo's described above, 
another aspect of the invention pertains to isolated nucleic acid molecules that are antisense 
thereto. An "antisense" nucleic acid comprises a nucleotide sequence that is complementary 
to a "sense" nucleic acid encoding a protein, e.g., complementary to the coding strand of a 
double-stranded cDNA molecule or complementary to an mRNA sequence. Accordingly, an 

■ 

antisense nucleic acid can hydrogen bond to a sense nucleic acid. The antisense nucleic acid 
can be complementary to an entire Rfo coding strand, or to only a portion thereof. In one_ 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the 
coding strand of a nucleotide sequence encoding a Rfo. The term "coding region" refers to 
the region of the nucleotide sequence comprising codons that are translated into amino acid 
residues (e.g., the entire coding region of „,„ comprises nucleotides 1 to ....). In another 
embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" of the 
coding strand of a nucleotide sequence encoding a Rfo. The term "noncoding region" refers 
to 5' and 3' sequences that flank the coding region that are not translated into amino acids 
(i.e., also referred to as 5' and 3* untranslated regions). 

[0061] In a preferred embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule which is a complement of the nucleotide sequence shown 
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herein, or a portion thereof- A nucleic acid molecule that is complementary to the nucleotide 
sequence shown herein is one which is sufficiently complementary to the nucleotide sequence 
shown such that it can hybridize to the nucleotide sequence shown, thereby forming a stable 
duplex. 

[0062] Given the coding strand sequences encoding the Rfo's disclosed herein, antisense 
nucleic acids of the invention can be designed according to the rules of Watson and Crick 
base pairing. The antisense nucleic acid molecule can be complementary to the entire coding 
region of Rfo mRNA, but more preferably is an oligonucleotide which is antisense to only a 
portion of the coding or noncoding region of Rfo mRNA. For example, the antisense 
oligonucleotide can be complementary-to-the region surrounding the translation' start site of 
Rfo mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 
35, 40, 45 or 50 or more nucleotides in length. 

[0063] An antisense nucleic acid of the invention can be constructed using chemical 
synthesis and enzymatic ligation reactions using procedures known in the art. For example, 
an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously modified nucleotides designed to increase 
the biological stability of the molecules or to increase the physical stability of the duplex 
formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and 
acridine substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, - 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6- 
isopentenyladenine, 1 -methylguanine, 1-methylinosine, 2, 2-dimethyl guanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5 -methy laminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, 5 '-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
i sop en tenyl adenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, unacil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3- 
N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense 
nucleic acid can be produced biologically using an expression vector into which a nucleic 
acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted 
nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described 
further in the following subsection). 
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[0064] The antisense nucleic acid molecules of the invention are typically administered to 
a cell or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a Rfo to thereby inhibit expression of the protein, e.g., by inhibiting 
transcription and/or translation. The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic 
acid molecule which binds to DNA duplexes, through specific interactions in the major 
groove of the double helix. The antisense molecule can be modified such that it specifically 
binds to a receptor or an antigen expressed on a selected cell surface, e.g., by linking the 
antisense nucleic acid molecule to a peptide or an antibody which binds to a cell surface 
receptor or antigen. The antisense nucleic acid molecule can also be-delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations""bf the 
antisense molecules, vector constructs in which the antisense nucleic acid molecule is placed 
under the control of a strong prokaryotic, viral, or eukaryotic (including plant) promoter are 
preferred. •* ■ 

[0065] In yet another embodiment, the antisense nucleic acid molecule of the invention is 
an ct-anomeric nucleic acid molecule. An oc-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual P-units, 
the strands run parallel to each other (Gaultier et al., 1987 Nucleic Acids. Res. 15:6625- 
6641). The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide 
(Inoue et al., 1987 Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue 
(Inoue et al., 1987 FEBS Lett. 215:327-330). 

[0066] In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., hammerhead ribozymes described in 
Haselhoff and Gerlach, 1988 Nature 334:585-591) can be used to catalytically cleave Rfo 
mRNA transcripts to thereby inhibit translation of Rfo mRNA. A ribozyme having 
specificity for a Rfo-encoding nucleic acid can be designed based upon the nucleotide 
sequence of a Rfo cDNA, corresponding to an ORF of a Rfo nucleic acid provided herein or 
on the basis of a heterologous sequence to be isolated according to methods taught in this 
invention. For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 
which the nucleotide sequence of the active site is complementary to the nucleotide sequence 
to be cleaved in a Rfo-encoding mRNA. See, e.g., Cech et al. U.S. Patent No. 4,987,071 and 
Gech et al. U.S. Patent No. 5,116,742. Alternatively, Rfo mRNA can be used to select a 
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catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, 
e.g., Bartel, D. and Szostak, J.W., 1993 Science 261:1411-1418. 

[0067] Alternatively, Rfo gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of a Rfo nucleotide sequence (e.g., a Rfo 
promoter and/or enhancer) to form triple helical structures that prevent transcription of a Rfo 
gene in target cells. See generally, Helene, C, 1991 Anticancer Drug Des. 6(6):569-84; 
Helene, C. et al., 1992 Ann. N.Y. Acad. Sci. 660:27-36; and Maher, L.J., 1992 Bioassays 
14(12):807-15. 

[0068] In addition to the Rfo nucleic acids and proteins described above, the present 
invention encompasses these-nueleic acids and proteins attached to a moiety. These moieties 
include, but are not limited to, detection "moieties, hybridization moieties, purification 
moieties, delivery moieties, reaction moieties, binding moieties, and the like. A typical group 
of nucleic acids having moieties attached includes probes and primers. Probes and primers 
typically comprise a substantially isolated oligonucleotide. The oligonucleotide typically 
comprises a region of nucleotide sequence that hybridizes under stringent conditions to at 
least about 12, preferably about 25, more preferably about 40, 50 or 75 consecutive 
nucleotides of a sense strand of the sequence set forth in the sequences, an anti-sense 
sequence of the sequence set forth in the sequences, or naturally occurring mutants thereof. 
Primers based on a nucleotide sequences herein can be used in PCR reactions to clone Rfo 
homologs. Probes based on the Rfo nucleotide sequences can be used to detect transcripts or 
genomic sequences encoding the same or homologous proteins. In preferred embodiments, 
the probe further comprises a label group such as a radioisotope, a fluorescent compound, an 
enzyme or an enzyme co-factor. Such probes can be used as a part of a genomic marker test 
kit for identifying cells which express a Rfo nucleic acid, such as by measuring a level of a 
/^-encoding nucleic acid, in a sample of cells, e.g., detecting Rfo mRNA levels or 
determining whether a genomic Rfo gene has been mutated or deleted. 

[0069] The invention further provides an isolated recombinant expression vector 

comprising a Rfo nucleic acid as described above, wherein expression of the vector in a host 

plant results in increased produced of viable pollen. As used herein, the term "vector" refers 

to a nucleic acid molecule capable of transporting another nucleic acid to which it has been 

linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA 

loop into which additional DNA segments can be ligated. Another type of vector is a viral 

vector, wherein additional DNA segments can be ligated into the viral genome. Certain 

vectors are capable of autonomous replication in a host cell into which they are introduced 

(e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian 
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vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated along 
with the host genome. Moreover, certain vectors are capable of directing the expression of 
genes to which they are operatively linked. Such vectors are referred to herein as "expression 
vectors''. In general, expression vectors of utility in recombinant DNA techniques are often 
in the form of plasmids. In the present specification, "plasmid" and "vector" can be used 
interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression vectors, such as viral vectors 
(e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which 
serve equivalent functions. — 

[0070] The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which means 
that the recombinant expression vectors include one or more regulatory sequences, selected 
on the basis of the host cells to be used for expression, which is operatively linked to the 
nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably 
linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory 
sequence(s) in a manner which allows for expression of the nucleotide sequence (e.g., in an in 
vitro transcription/ translation system or in a host cell when the vector is introduced into the 
host cell). The term "regulatory sequence" is intended to include promoters, enhancers and 
other expression control elements (e.g., polyadenylation signals). Such regulatory sequences 
are described, for example, in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990) or see: Gruber and Crosby, in: 
Methods in Plant Molecular Biology and Biotechnology, eds. Glick and Thompson, Chapter 
7, 89-108, CRC Press: Boca Raton, Florida, including the references therein. Regulatory 
sequences include those that direct constitutive expression of a~nucleotide sequence in many 
types of host cells and those that direct expression of the nucleotide sequence only in certain 
host cells or under certain conditions. It will be appreciated by those skilled in the art that the 
design of the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, etc. The expression vectors of the 
invention can be introduced into host cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., Rib proteins, 
mutant forms of Rfo proteins, fusion proteins, etc.). 

[0071] The recombinant expression vectors of the invention can be designed for 
expression of Rfo's in prokaryotic or eukaryotic cells. For example, Rfo genes can be 
expressed in multicellular plant cells (see Schmidt, R. and Willmitzer, L., 1988 High 
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efficiency Agj-obacterium tumefaciens-mediated transformation of Arabidopsis thaliana leaf 
and cotyledon explants, Plant Cell Rep. 583-586); Plant Molecular Biology and 
Biotechnology, C Press, Boca Raton, Florida, chapter 6/7, S.71-119 (1993); F.F. White, B. 
Jenes et al., Techniques for Gene Transfer, in: Transgenic Plants, Vol. 1, Engineering and 
Utilization, eds. Kung und R. Wu, 128-43, Academic Press: 1993; Potrykus, 1991 Annu. 
Rev. Plant Physiol. Plant Molec. Biol. 42:205-225 and references cited therein); C 
glutamicum, insect cells (using baculovirus expression vectors), yeast and other fungal cells 
(see Romanos, M.A. et al., 1992 Foreign gene expression in yeast: a review, Yeast 8:423- 
488; van den Hondel, C.A.MJJ. et al., 1991 Heterologous gene expression in filamentous 
fungi, in: More Gene Manipulations in Fungi, J.W. Bennet & L.L. Lasure, eds., p. 396-428: 
Academic Press: San Diego; and van den Hondel, C.A.MJJ. & Punt, P.J., 1991 Gene 
transfer systems and vector development for filamentous fungi, in: Applied Molecular 
Genetics of Fungi, Peberdy, J.F. et al., eds., p. 1-28, Cambridge University Press: 
Cambridge), algae (Falciatore et al., 1999 Marine Biotechnology 1(3):239-251) or 
mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press: San Diego, CA (1990). 
Alternatively, the recombinant expression vector can be transcribed and translated in vitro^ 
for example using T7 promoter regulatory sequences and T7 polymerase. 
[0072] Expression of proteins in prokaryotes is most often carried out with vectors 
containing constitutive or inducible promoters directing the expression of either fusion or 
non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded 
therein, usually to the amino terminus of the recombinant protein but also to the C-terminus 
or fused within suitable regions in the proteins. Such fusion vectors typically serve three 
purposes: 1) to increase expression of a recombinant protein; 2) to increase the solubility of a 
recombinant protein; and 3) to aid in the purification of a recombinant protein by acting as a 
ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site 
is introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to purification of 
the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor 
Xa, thrombin and enterokinase. 

[0073] Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, 

D.B. and Johnson, K.S., 1988 Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 

and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose 

E binding protein, or protein A, respectively, to the target recombinant protein. In one 

embodiment, the coding sequence of the Rfo is cloned into a pGEX expression vector to 
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create a vector encoding a fusion protein comprising, from the N-terminus to the C-terminus, 
GST-thrombin cleavage site-X protein. The fusion protein can be purified by affinity 
chromatography using glutathione-agarose resin. Recombinant Rfo unfused to GST can be 
recovered by cleavage of the fusion protein with thrombin. 

[0074] Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amann et al., 1988 Gene 69:301-315) and pET lid (Studier et al., Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 60- 
89). Target gene expression from the pTrc vector relies on host RNA polymerase 
transcription from a hybrid trp-lac fusion promoter. Target gene expression from the pET 
lid vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a co- — 
expressed viral RNA polymerase (T7 gnl). This viral polymerase is~~supplied by host strains 
BL21(DE3) or HMS174(DE3) from a resident X prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

[0075J In another embodiment, the Rfo expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari, et al., 
1987 EMBO J. 6:229-234), pMFa (Kurjan and Herskowitz, 1982 Cell 30:933-943), pJRY88 
(Schultz et al., 1987 Gene 54:113-123), and pYES2 (Invitrogen Corporation, San Diego, 
CA). Vectors and methods for the construction of vectors appropriate for use in other fungi, 
such as the filamentous fungi, include those detailed in: van den Hondel, C.A.MJ.J. & Punt, 
P.J. (1991) "Gene transfer systems and vector development for filamentous fungi", in: 
Applied Molecular Genetics of Fungi, J.F. Peberdy, et al., eds., p. 1-28, Cambridge 
University Press: Cambridge. 

[0076] Alternatively, the Rfo of the invention can be expressed in insect cells using 
baculovirus expression vectors. Baculovirus vectors available for expression of proteins in 
cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et aL, 1983 Mol. Cell Biol. 
3 :2 1 56-2 1 65) and the pVL series (Lucklow and Summers, 1 989 Virology 1 70:3 1 -39). 

[0077] In another embodiment, the Rfo of the invention may be expressed in unicellular 

plant cells (such as algae) (see Falciatore et al., 1999 Marine Biotechnology 1(3):239-251 and 

references therein), and more preferably, plant cells from higher plants (e.g., the 

spermatophytes, such as crop plants). Examples of plant expression vectors include those 

detailed in: Becker, D., Kemper, E., Schell, J. and Masterson, R., 1992 New plant binary 

vectors with selectable markers located proximal to the left border, Plant Mol. Biol. 20: 1 195- 

1197; and Bevan, M.W., 1984 Binary Agrobacteriwn vectors for plant transformation, Nucl. 

Acid. Res. 12:8711-8721; Vectors for Gene Transfer in Higher Plants; in: Transgenic Plants, 

Vol. 1, Engineering and Utilization, eds.: Kung and R. Wu, Academic Press, 1993, S. 15-38. 
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[0078] A plant expression cassette preferably contains regulatory sequences capable of 
driving gene expression in plant cells and operably linked so that each sequence can fulfill its 
function, for example, termination of transcription by polyadenylation signals. Preferred 
polyadenylation signals are those originating from Agrobacteriwn tumefaciens t-DNA such 
as the gene 3 known as octopine synthase of the Ti-plasmid pTiACHS (Gielen et aL, 1984 
EMBO J. 3:835) or functional equivalents thereof but also all other terminators functionally 
active in plants are suitable. As plant gene expression is very often not limited on 
transcriptional levels, a plant expression cassette preferably contains other operably linked 
sequences like translational enhancers such as the overdrive-sequence containing the 5'- 
untranslated leader sequence from tobacco mosaic virus enhancing the protein per RNA ratio 
(Gallie et aL, 1987 NucL Acids Research 15:8693-8711). 

[0079] Plant gene expression must be operably linked to an appropriate promoter in order 
to confer gene expression in a timely, cell or tissue specific manner. Preferred are promoters 
driving constitutive expression (Benfey et aL, 1989 EMBO J. 8:2195-2202) like those derived 
from plant viruses like the 35S CAMV (Franck et aL, 1980 Cell 21:285-294), the 19S CaMV 
(see also U.S. Patent No. 5,352,605 and PCT Application No. WO 8402913) or plant 
promoters like those from Rubisco small subunit described in U.S. Patent No. 4,962,028. 

Especially preferred are those promoters that confer gene expression in specific 
plant tissues and organs, such as stamens and anthers. In this regard, a promoter which 
expresses during stamen development would be preferred as such a promoter is particularly 
appropriate to drive Rfo expression resulting in altered pollen production as desired. 
Examples of such promoters include the AP3 promoter, the Lat52 promoter (Twell, D. et al. 
(1989). Mol. Gen. Genet. 217, 240-248; Twell, D. et al. (1990). Development 109, 705-715.), 
the A9 promoter (Paul, W. et al., (1992). Plant Mol. Biol. 19, 611-622.), the fbpl promoter 
(Angenent, G.C. (1993). Plant J. 4, 101-112), the EPF2-5 promoter (Takatsuji, H. et al. 
(1994). Plant Cell 6, 947-958), and the pfii4 promoter (Christensen, H.E. et al. (1996). Plant 
J. 10, 269-279). However, the utility of the present methods are not restricted with respect to 
the promoter. As will be appreciated by one of skill in the art, constitutive promoters and 
promoters which express during other stages of plant development, for example prior to 
stamen development, may also be useful in the present methods. 

[0080] Plant gene expression can also be facilitated via an inducible promoter (for review, 
see Gatz, 1997 Annu. Rev. Plant Physiol. Plant Mol. Biol. 48:89-108). Chemically inducible 
promoters are especially suitable if gene expression is wanted to occur in a time specific 
manner. Examples of such promoters are a salicylic acid inducible promoter (PCT 
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Application No. WO 95/19443), a tetracycline inducible promoter (Gatz et al., 1992 Plant J. 
2:397-404) and an ethanol inducible promoter (PCT Application No. WO 93/21334). 
[0081] Other preferred sequences for use in plant gene expression cassettes are targeting- 
sequences necessary to direct the gene product in its appropriate cell compartment (for 
review see Kermode, 1996 Grit. Rev. Plant Sci. 15(4):285-423 and references cited therein) 
such as the vacuole, the nucleus, all types of plastids like amyloplasts, chloroplasts, 
chromoplasts, the extracellular space, mitochondria, the endoplasmic reticulum, oil bodies, 
peroxisomes and other compartments of plant cells. 

[0092] In addition to providing a recombinant expression vector comprising a Rfo DNA 
molecule of the invention cloned into the expression vector injasense orientation, the present 
invention providessuch a vector wherein the Rfo DNA molecule is cloned into the vector in 
the antisense orientation. That is, the DNA molecule is operatively linked to a regulatory 
sequence in a manner that allows for expression (by transcription of the DNA molecule) of an 
RNA molecule that is antisense to a Rfo mRNA. Regulatory sequences operatively linked to 
a nucleic acid molecule cloned in the antisense orientation can be chosen which direct the 
continuous expression of the antisense RNA molecule in a variety of cell types. For instance, 
viral promoters and/or enhancers, or regulatory sequences can be chosen which direct 
constitutive, tissue specific or cell type specific expression of antisense RNA. The antisense 
expression vector can be in the form of a recombinant plasmid, phagemid or attenuated virus 
wherein antisense nucleic acids are produced under the control of a high efficiency regulatory 
region. The activity of the regulatory region can be determined by the cell type into which 
the vector is introduced. For a discussion of the regulation of gene expression using antisense 
genes see Weintraub, H. et al., Antisense RNA as a molecular tool for genetic analysis, 
Reviews - Trends in Genetics, Vol. 1(1) 1986 and Mol et al., 1990 FEBS Letters 268:427- 
430. The term antisense RNA is intended to also cover double stranded interfering RNAs 
(RNAi), which induce selective degradation of the RNAs complementary to one of the two 
dsRNA strands. 

[0093] Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms "host cell" and 

"recombinant host cell" are used interchangeably herein. It is understood that such terms 

refer not only to the particular subject cell but they also apply to the progeny or potential 

progeny of such a cell. Because certain modifications may occur in succeeding generations 

due to either mutation or environmental influences, such progeny may not, in fact, be 

identical to the parent cell, but are still included within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a Rfo can be expressed in 
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bacterial cells such as C. glutamicum, insect cells, fungal cells or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells), algae, ciliates, plant cells, fungi or other 
microorganisms like C. glutamicum. In a preferred embodiment, the host cell is a plant cell, 
more preferably, a Brassica napus plant cell, and most preferably, a stamen or anther cell. 
[0094] Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation", "transfection", "conjugation" and "transduction" are intended to refer to a 
variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a 
host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran- 
mediated transfection,- lipofection, natural competence,-chemical-mediated transfer and 
electroporation. Suitable methods for transforming or transfecting host cells including plant 
cells can be found in Sambrook, et al. (Molecular Cloning: A Laboratory Manual. 2 nd , ed., 
Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, 1989) and other laboratory manuals such as Methods in Molecular Biology, 1995, Vol. 
44, Agrobacterium protocols, ed: Gartland and Davey, Humana Press, Totowa, New Jersey. 
[0095] In particular, the invention provides a method of producing a transgenic plant with 
a Rfo coding nucleic acid, wherein the plant is a cytoplasmic male-sterile plant and wherein 
expression of the nucleic acid in the plant results in restoration of fertility of the plant 
comprising: (a) transforming a plant cell with an expression vector comprising a Rfo nucleic 
acid, and (b) generating from the plant cell a transgenic plant. The plant cell includes, but is 
not limited to, a gamete producing cell, a protoplast and any other cell that regenerates into a 
whole plant. 

[0096] For such plant transformation, binary vectors such as pBinAR can be used 
(Hofgen and Willmitzer, 1990 Plant Science 66:221-230). Construction of the binary vectors 
can be performed by ligation _of the cDNA in sense or antisense orientation into the T-DNA. 
5-prime to the cDNA a plant promoter activates transcription of the cDNA. A 
polyadenylation sequence is located 3-prime to the cDNA. Tissue-specific expression can be 
achieved by using a tissue specific promoter. For constitutive expression within the whole 
plant, the CaMV 35S promoter can be used. The expressed protein can be targeted to a 
cellular compartment using a signal peptide, for example for plastids, mitochondria or 
endoplasmic reticulum (Kermode, 1996 Crit. Rev. Plant Sci. 4 (15):285-423). The signal 
peptide is cloned 5-prime in frame to the cDNA to archive subcellular localization of the 
fusion protein. One skilled in the art will recognize that the promoter used should be 
operatively linked to the nucleic acid such that the promoter causes transcription of the 
nucleic acid and results in the synthesis of a mRNA which encodes a polypeptide. 
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Alternatively, the RNA can be an antisense RNA for use in affecting subsequent expression 
of the same or another gene or genes. 

[0097] Alternate methods of transfection include the direct transfer of DNA into 
developing flowers via electroporation or Agrobacterium mediated gene transfer. 
Agrobacterium mediated plant transformation can be performed using for example the 
GV3101(pMP90) (Koncz and Schell, 1986 Mol. Gen. Genet. 204:383-396) or LBA4404 
(Clontech) Agrobacterium tumefaciens strain. Transformation can be performed by standard 
transformation and regeneration techniques (Deblaere et al., 1994 Nucl. Acids. Res. 13:4777- 
4788; Gelvin, Stanton B. and Schilperoort, Robert A, Plant Molecular Biology Manual, 2 nd 
— Ed. - DordrechL:.Kluwer Academic Publ., 1995. - in-Sect., Ringbuc Zentrale Signatur: BT1 1- _ 
P ISBN 0-7923-2731-4; Glick, Bernard R.; Thompson, John E., Methods in Plant Molecular 
Biology and Biotechnology, Boca Raton : CRC Press, 1993 360 S., ISBN 0-8493-5164-2). 

[0098] A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) a Rfo. Accordingly, the invention further 
provides methods for producing Rib proteins using the host cells of the invention. In one 
embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding a Rfo protein has been introduced, or into which 
genome has been introduced a gene encoding a wild-type or altered Rfo protein) in a suitable 
medium until Rfo protein is produced. In another embodiment, the method further comprises 
isolating Rfo proteins from the medium or the host cell. 

[0099] Another aspect of the invention pertains to isolated Rfo proteins, and biologically 
active portions thereof. An "isolated" or "purified" protein or biologically active portion 
thereof is free of some of the cellular material when produced by recombinant DNA 
techniques, or chemical precursors or other chemicals when chemically synthesized. The 
language "substantiallyjEree of cellular material" includes preparations of Rfo protein in 
which the protein is separated from some of the cellular components of the cells in which it is 
naturally or recombinantly produced. In one embodiment, the language "substantially free of 
cellular material" includes preparations of a Rfo protein having less than about 30% (by dry 
weight) of non-Rfo protein material (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-Rfo protein material, still more preferably less 

* 

than about 10% of non-Rfo protein material, and most preferably less than about 5% non-Rfo 
protein material. 

[00100] When the Rfo protein or biologically active portion thereof is recombinantly 
produced, it is also preferably substantially free of culture medium, i.e., culture medium 
represents less than about 20%, more preferably less than about 10%, and most preferably 
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less than about 5% of the volume of the protein preparation. The language "substantially free 
of chemical precursors or other chemicals" includes preparations of Rfo protein in which the 
protein is separated from chemical precursors or other chemicals that are involved in the 
synthesis of the protein. In one embodiment, the language "substantially free of chemical 
precursors or other chemicals" includes preparations of a Rfo protein having less than about 
30% (by dry weight) of chemical precursors or non-Rfo protein chemicals, more preferably 
less than about 20% chemical precursors or non-Rfo protein chemicals, still more preferably 
less than about 10% chemical precursors or non-Rfo protein chemicals, and most preferably 
less than about 5% chemical precursors or non-Rfo protein chemicals. In preferred 
embodimentsr isolated proteins, or biologically active_portions thereof, lack contaminating 
proteins from the same organism from which the Rfo protein is derived. 

[00101] The present invention also provides antibodies that specifically bind to a Rfo 
protein, or a portion thereof, as encoded by a nucleic acid described herein. Antibodies can 
be made by many well-known methods (See, e.g. Harlow and Lane, "Antibodies; A 
Laboratory Manual" Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 
(1988)). Briefly, purified antigen can be injected into an animal in an amount and in intervals 
sufficient to elicit an immune response. Antibodies can either be purified directly, or spleen 
cells can be obtained from the animal. The cells can then fused with an immortal cell line 
and screened for antibody secretion. The antibodies can be used to screen nucleic acid clone 
libraries for cells secreting the antigen. Those positive clones can then be sequenced. (See, 
for example, Kelly et al., 1992 Bio/Technology 10:163-167; Bebbington . et al., 1992 
Bio/Technology 10:169-175). 

[00102] The phrases "selectively binds'' and "specifically binds" when referring to binding 
to a polypeptide refer to a binding reaction that is determinative of the presence of the protein 
in a heterogeneous population of proteins and other biologies. Thus, under designated 
immunoassay conditions, the specified antibodies bound to a particular protein do not bind in 
a significant amount to other proteins present in the sample. Selective binding of an antibody 
under such conditions may require an antibody that is selected for its specificity for a 
particular protein. A variety of immunoassay formats may be used to select antibodies that 
selectively bind with a particular protein. For example, solid-phase ELISA immunoassays 
are routinely used to select antibodies selectively immunoreactive with a protein. See 
Harlow and Lane "Antibodies, A Laboratory Manual" Cold Spring Harbor Publications, New 
York, (1988), for a description of immunoassay formats and conditions that could be used to 
determine selective binding. 
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[00103] In some instances, it is desirable to prepare monoclonal antibodies from various 
hosts. A description of techniques for preparing such monoclonal antibodies may be found in 
Stites et al., editors, "Basic and Clinical Immunology/' (Lange Medical Publications, Los 
Altos, Calif., Fourth Edition) and references cited therein, and in Harlow and Lane 
("Antibodies, A Laboratory Manual" Cold Spring Harbor Publications, New York, 1988). 
[00104] The nucleic acid molecules, proteins, protein homologs, fusion proteins, primers, 
vectors, and host cells described herein can be used in one or more of the following methods: 
producing a hybrid plant that contains the nucleic acid molecules of the present invention, 
modulating fertility in a plant, increasing production of viable pollen in a cytoplasmic male- 
sterile-plant, restoring fertility in a cytoplasmic male-sterile plant, identifying Raphanus 
sativum and related" organisms, mapping of genomes of organisms related to Raphanus 
sativum, identifying and localizing Raphanus sativum sequences of interest and pei?/brming 
evolutionary studies. 

[00105] Accordingly, the present invention provides a method of producing a hybrid plant, 

comprising crossing a male-fertile plant containing the Rfo nucleic acids of the present 

invention with a male-sterile plant, collecting hybrid seed from the male-sterile plant and 

regenerating the hybrid plant from the seed. In a preferred embodiment, the male-sterile 

plant contains an ogu cytoplasmic male sterility determinant. The present invention also 

provides a method of modulating the fertility of a transgenic plant that includes expressing a 

Rfo nucleic acid in the plant. Preferably, the plant comprises a male-sterility determinant and 

expression of the nucleic acid sequence in the plant results in increased production of viable 

pollen by the plant. In a preferred embodiment, expression of the nucleic acid sequence in 

the plant results in restoration of fertility of the plant. In a more preferred embodiment, the 

plant contains an ogu cytoplasmic male sterility determinant. The present invention describes 

using the expression of Rfo of Raphanus sativum to restore fertility in male-sterile plants. 

The invention also provides a transgenic plant containing a Rfo nucleic acid or a fragment 

thereof, wherein the plant has increased fertility or viable pollen production as compared to a 

cytoplasmic male-sterile plant of the same variety. The transgenic plant can be a monocot or 

a dicot. The invention further provides that the transgenic plant can be selected from maize, 

wheat, rye, oat, triticale, rice, barley, soybean, peanut, rapeseed, canola, pepper, sunflower, 

Vicia species, pea, alfalfa, bushy plants (coffee, cacao, tea), Salix species, radish, sorghum, 

pearl millet, cotton, and tobacco. In a preferred embodiment, the transgenic plant is a canola 

plant selected from the group consisting of Brassica napus, Brassica rapa (ox campesfris), 

Brassica oleracea, Brassica nigra, Brassica juncea, Sinapis alba, and Brassica carinata. In 

a more preferred embodiment, the male-sterile plant is Brassica napus. 
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[00106] The present invention also allows for the production of a true breeding variety of 
plants that are capable of restoring male fertility in a Fl hybrid descendant of a plant of the 
present invention and a male-sterile plant. This type of true breeding variety of a fertility 
restorer plant is also termed a "restorer line". The terms "variety" and "line" refer to a group 
of plants within a species that share constant characters that separate them from the typical 
form and from other possible varieties within that species. While possessing at least one 
distinctive trait, a variety or line is also characterized by some variation between individuals 
within the variety or line, based primarily on the Mendelian segregation of traits among the 
progeny of succeeding generations. A variety or line is considered "true breeding" for a 
particular trait if it is genetically homozygous for that trait to the extent that, when the^ true- 
breeding variety or line is self-pollinated, a significant amount of independent segregation of 
the trait among the progeny is not observed. In the present invention, the trait arises from the 
transgenic expression of a single DNA sequence introduced into a plant variety or plant line. 
[00107] In addition to introducing the Rfo nucleic acids into transgenic plants, these 
sequences can also be used to identify a plant as being Raphanus sativum or a close relative 
thereof. Also, they may be used to identify the presence of Raphanus sativum or a relative 
thereof in a mixed population of plants. The invention provides the nucleic acid sequences of 
a number of Raphanus sativum genes; by probing the extracted genomic DNA of a culture of 
a unique or mixed population of plants under stringent conditions with a probe spanning a 
region of a Raphanus sativum gene which is unique to this plant, one can ascertain whether 
this plant or the genus is present. 

[00108] More importantly, the Rfo nucleic acids can be used to isolate Rfo homologs in 
other species. The nucleotide sequences determined from the cloning of the Rfo genes from 
Raphanus sativum allow for the generation of probes and primers designed for use in 
identifying and/or cloning Rfo homologs in other cell types and plants such as Brassica 
napus, as well as Rfo homologs from other radishes and related species. 

[00109] Further, the nucleic acid and protein molecules of the invention may serve as 
markers for specific regions of the genome. This has utility not only in the mapping of the 
genome, but also in functional studies of Raphanus sativum proteins. For example, to 
identify the region of the genome to which a particular Raphanus sativum DNA-binding 
protein binds, the Raphanus sativum genome could be digested, and the fragments incubated 
with the DNA-binding protein. Those fragments that bind the protein may be additionally 
probed with the nucleic acid molecules of the invention, preferably with readily detectable 
labels. Binding of such a nucleic acid molecule to the genome fragment enables the 
localization of the fragment to the genome map of Raphanus sativum, and, when performed 
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multiple times with different enzymes, facilitates a rapid determination of the nucleic acid 
sequence to which the protein binds. Further, the nucleic acid molecules of the invention 
may be sufficiently homologous to the sequences of related species such that these nucleic 
acid molecules may serve as markers for the construction of a genomic map in related 
radishes. 

[00110] The nucleic acid and protein molecules of the invention may also be used as 
markers to identify regions of the .fl/b-radish genome as being tightly genetically-linked to the 
Rfo gene. For example, DNA sequences given in SEQ ID NO:87 could be used as probes to 
identify restriction fragment length polymorphisms (Tanksley et al. (1987) In: Chromosome 
structure and function. Plenum Press N.Y. ppl57-173) that are tightly genetically linked to 
the Rfo gene. These DNA markers could therTbe used to identify individuals in plant 
breeding programs that might have a minimal amount of radish DNA in the region flanking 
Rfo and which therefore would be highly useful for the development of effective, low 
glucosinolate B. napus restorer lines. Similarly, these sequences could be used to design 
PCR based DNA markers, such as SNPs (The International SNP Map Working Group (2001) 
Nature 409: 928-933) and SSRs (Tautz D (1989) Nucl Acids Res 17: 6463-6471) that could 
be used in simlar fashion. 

[00111] The nucleic acid molecules given in SEQ ID NO:87 that function as restorer genes 
could also be used as selection markers to identify transformed plant cells. For example, a 
sequence that functions as a restorer gene could be combined with another gene of interest in 
a transformation vector. The vector could then be introduced into plant cells by any of a 
number of methods such as Agrobacterium-mediated transformation. If the plant cells into 
which the DNA is introduced are cytoplasmically male sterile, the anthers that form on the 
regenerated plants will normally not produce any pollen and no seed will form from these 
flowers by self-pollination. If, however, the anthers form from cells that have acquired a 
restorer gene and the associated gene of interest, the anthers will produce pollen and these 
flowers will form seeds. This provides a powerful selection system for the identification of 
transformed plants or parts of plants. The use of restorer genes, which are purely normal 
plant genes, as selection markers may have some advantages over other currently used 
selection markers with respect to regulatory issues. The use of restorer genes as selection 
markers is not intended to be limited to the ogu CMS system and Rfo 9 but rather could apply 
to any restorer gene for any CMS system in any plant species. 

[00112] The Rfo nucleic acid molecules of the invention are also useful for evolutionary 

and protein structural studies. By comparing the sequences of the nucleic acid molecules of 

the present invention to those encoding similar proteins from other organisms, the 
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evolutionary relatedness of the organisms can be assessed. Similarly, such a comparison 
permits an assessment of which regions of the sequence are conserved and which are not, 
which may aid in determining those regions of the protein that are essential for the 
functioning of the Rfo proteins. This type of determination is of value for protein 
engineering studies and may give an indication of what the protein can tolerate in terms of 
mutagenesis without losing function. 

[00113] Throughout this application, various publications are referenced. The disclosures 
of all of these publications and those references cited within those publications in their 
entireties are hereby incorporated by reference into this application in order to more fully 
describe the state of the art to whiclT this invention pertains. It should also be understood that"" 
the foregoing relates to preferred embodiments of the present invention and that numerous 
changes may be made therein without departing from the scope of the invention. The 
invention is further illustrated by the following examples, which are not to be construed in 
any way as imposing limitations upon the scope thereof. On the contrary, it is to be clearly 
understood that resort may be had to various other embodiments, modifications, and 
equivalents thereof, which, after reading the description herein, may suggest themselves to 
those skilled in the art without departing from the spirit of the present invention and/or the 
scope of the appended claims. 

EXAMPLES 

Materials and Methods 

Plant growth, DNA isolation, marker analyses and library construction 
[001 14] Radish plants were grown in growth chambers with a 16 hour photoperiod cycle, 
at 20°C day and 15°C night. To self- fertilize these plants, a bud that was about to open was 
teezed apart and its anthers and petals removed. An anther from another flower on the same 
plant was picked and used for pollination. The pollen was applied directly to the stigma. This 
procedure was repeated on 3 or 4 buds of the same branch. Any nearby open flowers were 
detached. A white paper crossing bag was used to cover the buds. The plants were then 
placed in a greenhouse to allow for seed maturation. Fresh leaf material was used for DNA 
extraction. DNA extraction, restriction enzyme digestion, Southern blotting and RFLP 
analysis were carried according to Cheung WY et al (1997, Theor Appl Genet 95:73-82). 
Small scale DNA preparations suitable for PCR analysis were according to Cheung WY, 
Hubert N, Landry BS (1993). PCR Methods Applic 3:69-70. 

[00115] Genomic DNA from radish plants homozygous for the Rfo restorer locus was 

isolated, and BAC library construction and preparation of probes from BAC ends was 
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according to Woo SS, Jiang J, Gill BS, Paterson AH, Wing RA (1994) Nucleic Acids Res 22: 
4922-4931. High density colony filter of BAC clones were generated and screened according 
to the procedure of Clemson University Genomic Institute (Tornkins JP, Mahalingham R, 
Smith H, Goicoechea JL, Knap HT, Wing R (1999) Plant Mol Biol 41: 25-32). A genomic 
library from a radish plant homozygous for the restorer was constructed in the fosmid vector 
pFOSl (New England Biolabs) according to the manufacturer's instructions. The library was 
screened by successive rounds of colony lifts as described in Sambrook et al (1989) 
Molecular Cloning: A laboratory Manual. 2 nd ed. Cold Spring Harbor laboratory Press, Cold 
Spring Harbor, NY. 



BAC and fosmid sub-cloning and sequencing 

[00116] BAC and fosmid DNA was isolated from exponentially growing E. coli 
(Electromax DH10B, Gibco) cells using a Qiagen Large-Construct Preparation Kit. The DNA 
was sheared to a size of approximately two to six kb with a Hydroshear (GeneMachines) 
according to manufacturer's recommendations. The ends of the sheared fragments were 
blunted using Gibco T4 DNA polymerase. The pBluescriptll phagemid (Stratagene) was used 
as a vector to sub-clone the blunted sheared DNA fragments. To prepare for ligation, the 
vector was digested with the restriction enzyme EcoKV (Amersham-Pharmacia Biotech) and 
dephosphorylated with Calf Intestine Phosphatase (CIP, New England Biolabs). The vector 
was then separated on a 0.7% agarose gel, excised and gel purified using the Sephaglas Kit 
(Amersham-Pharmacia Biotech). This digested, dephosphorylated and gel purified vector was 
ligated to the blunted sheared DNA fragments using the T4 DNA ligase (Gibco). After an 
overnight incubation at 16 deg. C, the ligation mixture was chemically transformed into 
E.coli (XLl-Blue, Stratagene) as described in The NEB Transcript 6: (1) 7. Recombinants 
plasmid DNA was isolated with an Autogen 740, restricted with BamRI-XJtoI (Amersham- 
Pharmacia Biotech) enzymes and separated on 0.8% agarose gels to determine insert size. 
Isolated plasmids were also sequenced with Li-COR DNA sequencers LONG-READIR 4200 
according to manufacturer's recommendations. 

Assembly of BAC and fosmid sequences 

[00117] Raw trace files were obtained in SCF format from one often Li-COR sequencers. 
Read lengths of approximately 800bp were obtained for each of the forward and reverse 
strands from a particular clone. The trace files were collected in a single directory on a 
UNDC server. The Staden software package was installed and configured on a Sun 420R 
server. 
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[00118] Pregap4 was used to process the SCF files and generate EXP files, which were 
assembled into a contiguous sequence. Phred was used to produce a quality score for each 
base within each read. The raw sequence was clipped according to a minimum quality score 
of 15 as produced through the use of Phred. Cross match was used to screen out any 
contaminating cloning vector sequence. Similarly Cross Match was used to mask any 
sequencing vector found in the raw sequence. RepeatMasker was used to mask any region 
that contained repetitive sequence, which may skew the assembly process. Once processed, 
the resulting EXP files were assembled using the Phrap program. Assembly was performed 
with a minimum exact match value of 12 and a minimum SWAT score of 30. 
[001-19]- The resulting assembly produced~a preliminary sequence assembly, which was 
analyzed with the Gap4 software package. A minimum of five-fold coverage was deemed to 
be acceptable in considering a particular region of sequence completed and correct. The 
assembly was meticulously analyzed for incorrect and erroneous sequence position manually. 
Chimeric clones were discovered and removed. These were visible using the. known insert 
size of a particular clone and comparing the position of the reverse and forward reads in the 
assembly. Primers were designed using the Oligo 6 software (Molecular Biology Insights) in 
regions that contained potential mismatches or discrepancies. PGR amplifications were 
perforrmed, amplification products were cloned and the resulting sequences were assembled 
and the correctness of the sequence determined. 

[00120] Further evidence of sequence correctness was provided by restriction enzyme 
profiles of the individual BAC and fosmid clones and the corresponding consensus sequence. 
The use Gap4 Software and the recognition sites for EcoRl, EcoRV and HindUl produced an 
expected pattern of size fragments. These were then compared to restriction analysis 
performed in the laboratory, which concurred the correct sequence. 

Sub-cloning and plant transformation 

[00121] Constructs containing the Rfo genes of interest are introduced into Brassica napus 
as described below. B. napus seed are sterilized in 20% commercial bleach for 30 minutes 
with shaking. The seeds are then washed 5 times in a sterile funnel and filter paper with 
sterile distilled water. The seeds (~20 per plate) are placed on Bn I (seed germination 
medium, see below), wrapped in Parafilm and incubated at 25°C for 5 days. Agrobacteriwn 
with the plasmid of interest is inoculated into 5 ml LB + Kanamycin 100 (100 mg/L) + 
Gentamycin 25 ( 25mg/L) ( for clones in pRD400 vector) or 5 ml LB + tetracyclin (2.5 mg/L) 
+ Gentamycin 25 (25 mg/L) ( for clones in pOCA18 vector)and incubated at 28°C with 

shaking for 24 hours. 50 jjL of this culture is transferred to 5 mL of LB with the appropriate 
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antibiotics as decribed above and incubated at 28°C with shaking for 24 hours. 5 mL of this 
culture is removed and recovered by centrifugation. The cell pellet is resuspended in 1 mL of 
MS media (no antibiotic), diluted 1 : 1 00 in MS media and used for plant transformation. 
[00122] For transformation, the petiole of 4 to 5 day old Brassica cotyledons are cut with a 
sterile scalpel. The cut surface is dipped in the diluted Agi-obacterium suspension for 1 
second and the cotyledons on are placed on Bn II (co-cultivation) medium by pushing the 
petiole into the soft agar. 20 cotyledons are placed on each plate. The plates are wrapped in 
Parafilm and incubated for 3 days at 25°C in a growth room under constant illumination. 
After 2 days, the cotyledons are transferred immediately to Bn IV (selection regeneration) 
^medium if Agrobacterium growth-around the petiole is noted. Otherwise, the cotyledons are 
transferred to Bn IV (selection regeneration) medium after 3 days. Upon transfer, the plates 
are sealed with Parafilm and incubated at 25°C under constant illumination. Cotyledons are 
transferred to fresh plates after 7 days and again every 10 days after that. At this stage a 
maximum of 10 cotyledons are placed on each plate. Callus formation at the tip of the petiole 
occurs within several weeks. 

[00123] At this time, bleached shoots are removed. Putative transgenic shoots that arise 
from the callus are green and may look vitrified. These are excised and transferred to Bn V 
shoot elongation medium. Attached calli are removed from the shoot. In Bn V media some 
shoots will develop roots at which point the plantlets can be transferred to soil. After several 
weeks on Bn V media, if the shoots have not developed roots, they are transferred to Bn VI 
(rooting) media. Once roots have formed, the plantlets can be transferred to soil. If rooting 
seems not to occur after several transfers, the plantlets may be put directly into soil. 
[00124] If the B. napus seed used in the transformation procedure above was an ogu CMS 
fine, the recovery of male fertile plants is indicative of the presence of the Rfo gene in the 
introduced DNA. The transgenic plant could also be screened for an ability to restore male- 
fertility by crossing the transgenic plant with a male-sterile plant and determining if the Fl 
generation hybrid is male-fertile. A determination as to whether male-fertility is restored in a 
plant can be made, for example, by 1) visually assessing an increase in the production of 
pollen as compared to a male-sterile plant or 2) determining that the plant can self-fertijize as 
evidenced by placing a bag over a flower on the plant and finding an increase of seed therein 
as compared to a male-sterile plant. 
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Media 

Bn I: Murashige-Skoog (MS) minimal medium (Sigma), 3% sucrose, pH 5.8 
Bn II: MS, 3% sucrose, 4.5 mg/L Benzyladenine (BA), pH 5.8 
Bn IV: MS, 3% sucrose, 4.5 mg/L BA, 20 mg/L Kanamycin (Km), 500 mg/L 
300mg/L Timentin (Tn), pH 5.8 

Bn V: MS, 3% sucrose, 20 mg/L Km, 300 mg/L Tn, pH 5.8 Bn VI: MS, 3% 

sucrose, 2 mg/L Indole butyric acid (IB A), 300 mg/L Tn, pH 5.8. 

All of the media contain 0.7% w / v phytagar. 

LB: lOg/L Tryptone, 5g/L yeast extract, lOg/L NaCl, pH 7.0 



Analysis of transgenic plants 

[001251 Individual transgenic plants were raised to maturity and visually assessed for male 
fertility/sterility. Small samples of leaf DNA were taken from the plants and analyzed by 
PCR using primers for specific genes and for the NPTII plant selection maker in the vectors. 
Gene 16 and Gene 26 transgenic plants were further analyzed for the vector copy number and 
integrity by Southern blot analysis. Only plants testing positive for the NPTII, introduced 
gene and orfl38 are listed in Table 1. 

RACE Analysis 

[00126] Total RNA from fertile radish plants and fertile Brassica transgenic plants were 
extracted at flowering stage. The Gene specific primers were designed from the predicted 
exons of Genes 16 and Gene 26. A SMART™ RACE cDNA amplification kit (Clontech) was 
used to generate 5 5 and 3' RACE products. The PCR products were purified and sequenced 
directly. 

Scientific approach 

[00127] A strategy termed positional or map-based cloning, that allows the identification 

and isolation of genes solely on the basis of the phenotype they confer, was employed to 

isolate these genes. This strategy is based on the principle of genetic mapping. Populations 

are developed in which the gene of interest, in this case the Rfo restorer gene, is segregating 

genetically. In the present example, mapping populations were developed by crossing a 

radish restorer line (a line that is homozygous for the Rfo locus) with a radish Ogura CMS 

line lacking the restorer gene (or homozygous for the recessive Rfo allele of the restorer 

locus). The resulting Fl generation plants were then self-fertilized to create F2 populations. 

Individual F2 plants heterozygous for the restorer gene were self-fertilized to create F3 
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populations. F4 and F5 populations were developed by self-fertilizing F3 and F4 plants, 
respectively. 

[00128] Markers that detect differences in the original parent plants are used to track the 
chromosomal segments from each parent that are transmitted to the different progeny plants. 
In the current case, markers employed were DNA markers, which directly detect DNA 
sequence differences between the parental chromosomes. The restorer gene itself can be 
tracked by its phenotype; plants with the ogu male sterile cytoplasm that lack the restorer 
gene are male sterile (Fig. IB) and possess small sterile stamens that are clearly 
distinguishable from those of male fertile plants with the ogu cytoplasm that contain the 
restorer gene (Fig. 1 A). Rfo is-a dominant gene, meaning that a plant need inherit only one 
copy of the Rfo locus to be male fertile; plants heterozygous for Rfo are indistinguishable 
from plants homozygous for Rfo. Plants lacking a copy of Rfo (or that are homozygous for 
the Rfo recessive allele) with the male sterile cytoplasm are sterile. Chromosomal maps can 
be generated that are based on the 'frequency with which the markers from a particular parent 
are co-inherited in progeny plants: the more frequently two markers are inherited together, 
the closer they are located to one another on the chromosome. 

[00129] For markers very close to the restorer gene, at least one copy of the allele from the 
fertile parent will almost always be found in fertile plants. It is possible to genetically delimit 
the chromosomal region containing the restorer gene on this basis: if the corresponding 
segments of two homologous chromatids of a fertile plant each carry alleles from the sterile 
parent, that segment cannot contain the restorer gene. Similarly, if a chromosome segment 
in a sterile plant carries alleles of the fertile parent, that segment cannot contain the restorer 
gene. 

[00130] The availability of DNA markers that are genetically tightly linked to a gene of 
interest is the starting point for a map-based cloning strategy. These markers are used to 
isolate genomic DNA clones, which may, in turn be used to select additional genomic clones. 
The individual clones so isolated are grouped into "contigs", sets of overlapping cloned DNA 
segments, until a single contig that spans the genomic region surrounding the gene of interest 
is assembled. By introducing the different portions of the cloned region or contig into plants 
of the appropriate genotype, and subsequently assessing the phenotype of these plants, it is 
possible to identify the specific portion of the contig that contains the gene. To characterize 
the gene, the sequence and expression of this specific portion is analyzed. 
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Genetic localization of the radish Rfo locus 

[00131] Rfo mapping populations were developed using both Asian and European radish 
varieties. Initial studies indicated that B. napus RFLP markers that co-segregated with Rfo in 
B. napus crosses mapped up to 60 cM from one another in radish, consistent with the genetic 
distance separating these markers in B. napus crosses that do not involve Rfo (Delourme R, 
Foisset N, Horcais R, Barret P, Champagne G, Cheung W, Landry BS & Renard M (1998) 
Theor Appl Genet 97: 129-134). These mapping studies were extended by developing F3, F4 
and F5 populations. One of the F4 populations (D26/44 F4) comprised 90 individuals and 
became the focus of subsequent analyses aimed at more extensive targeting and mapping of 
markers to the Rfo region. New RfoAvriked RFLP markers, identified by bulked segregant 
analysis ( Michelmore RW, Paran L KessehRV (1991) Proc Natl Acad Sci USA. 88:9828- 
9832.), were mapped on this F4 population. One marker, DLM82, was found to co-segregate 
with Rfo, while two others, DLM 299 and DLM316, were found to map within 4.6 and 9.2 
cM of Rfo, respectively. Recombination between the DLM 82 marker and Rfo was detected 
through analysis of a second radish F4 population (D63/41 F4) of 89 individuals. The results 
from these mapping studies, summarized in Fig. 2, indicated that markers spanning the Rfo 
region had been identified. 

[00132] A radish plant homozygous for i?/b-associated alleles of all markers in the Rfo 
region was used as the source of genomic DNA to construct a BAC genomic library suitable 
for cloning the gene. A library of 48,000 clones with an average insert size of 43 kb was 
generated and arrayed in 384-well microtiter plates. This library represents the radish 
genome at approximately 4-fold redundancy. To exploit the regional synteny between radish 
and Arabidopsis, Brassica/radish RFLP markers mapping close to Rfo were first hybridized to 
an Arabidopsis BAC library to identify corresponding Arabidopsis clones; these were then 
selected and a contig of overlapping Arabidopsis BACs was constructed using a BAC 
fingerprinting database. Markers derived from these BACs as well as radish RFLP markers 
mapping close to Rfo were used as probes to identify corresponding radish BAC clones. End 
probes from these Arabidopsis and radish BACs were then used to identify additional, 
overlapping BAC clones, and thereby extend the different contigs. 

[00133] Clone DLM82 identified several BAC clones in the Arabidopsis library; end 
probes from one of these detected polymorphism between sterile and fertile radish bulks. 
Starting with this clone, it was possible to assemble an Arabidopsis contig of BACs, which 
represent the segment of the Arabidopsis genome extending from Chromosome 1 nucleotides 
23,391,584 to 23,806,826 (F2202-F1301 1) (Arabidopsis sequence coordinates and BACs 
are listed on the Arabidopsis Genome Initiative website: 
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http://www.arabidopsis.org/agi.html). End probes from clones of this region detected 
polymorphism between the radish bulks, indicating that synteny between radish and 
Arabidopsis was maintained in this region. An Arabidopsis BAC derived end probe LI 2 was 
found to hybridize to two polymorphic radish fragments; these fragments represent two 
linked but independent loci, L12a and L12b. One of these, L12a, co-segregated with B. 
napus marker DLM 82. Two recombination events were found to have occurred in the F4 
mapping population between the LI 2a and LI 2b loci, with LI 2b being located distal to Rfo, 
between the B. napus RFLP markers DLM356 and DLM82. These results are summarized in 
Fig. 2. Probes derived from the ends of other Arabidopsis BACs were found to map within 
the interval defined by LI 2b and DLM82/L12a .^Additional-genetic mappingJn-radish of 
markers derived from this Arabidopsis contig, such as L40 (Fig. 3A) indicated that the 
physical location of the markers was consistent with their genetic position, and hence that co- 
linearity in this region between the Arabidopsis and radish genomes (radish regions rBl-rC 
and Arabidopsis B-C, Fig 3A) was maintained. 

[00134] One Arabidopsis marker located in the vicinity of LI 2, was found, like LI 2, to be 
duplicated in radish, suggesting that a portion of the syntenic Arabidopsis region is duplicated 
in the radish genomic region near Rfo (region rB2, Fig. 3 A). Further analysis of the radish 
population using markers derived from the Arabidopsis BACs indicated a lack of synteny 
between radish and Arabidopsis for markers located to the right of L40 and suggested an 
inversion of markers in the duplicated region (region rB2, Fig. 3A). This further suggested 
that the genomic region in radish containing Rfo might correspond to the Arabidopsis region 
flanking the L12 sequence but distal to the L40 sequence (region A, Fig. 3B). 
[00135] A chromosome walk from the opposite side of Rfo was initiated using the RFLP 
probe DLM299, which detects a small gene family in radish, only one member of which 
appears to be linked to Rfo. DLM299 was used to recover two radish BAG clones, from 
which derived probes detected differences between the bulked homozyogous fertile and 
sterile radish DNA, indicating linkage to Rfo. The sequences which comprise the DLM299 
region recovered in this manner were found to be located within two radish BACs. Probes 
derived from these BACs were used to recover an additional set of radish BACs. 
Interestingly, an end probe from one of these additional BACs (64K20) mapped closer to Rfo 
than did DLM299. 

[00136] Further localization of the Rfo locus was accomplished by developing and 
analyzing a European radish mapping population of 135 individuals, together with an F5 
Asian radish population of 900 individuals. A radish BAC recovered using sequences from 
Arabidopsis region rB2 (11K10) was sequenced and used to derive a CAPs marker 
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(Konieczny A, Ausubel FM (1993) Plant J 4: 403-410) to facilitate analysis of this large 
population. A CAPS marker was also derived using sequence information from BAC 64K10, 
which maps to the opposite side of Rfo. By screening small scale DNA preparations from all 
900 individuals using these markers it was possible to identify individuals in which 
recombination had occurred in the vicinity of Rfo and which were therefore informative for 
more detailed mapping analysis. In addition, the radish BAC contigs were extended by using 
probes derived from Arabidopsis region A (roughly Chromosome 1 nucleotides 23,000,000 
to 23,400,000 in the Arabidopsis genome; see Fig. 3B) to recover additional radish BACs. 
These, together with radish BACs corresponding to region rB2, were used to recover more 
radish BACs and all these were assembled into contigs on the basis of overlapping restriction 
enzyme profiles. Gaps between the contigs were filled by recovering clones from a cosmid 
library of genomic DNA isolated from a plant known to be homozygous for the Rfo region. 
The physical relationship among the clones from a portion of this contig is shown in Fig. 4. 
[00137] BAC and cosmid clones showing complete genetic linkage to Rfo were sequenced. 
In total, a sequence of over 270 kb, representing a minimum of 5 -fold coverage of each 
genomic region, was assembled and annotated. The result of the sequence analysis of this Rfo 
region is provided in the Appendix as SEQ ID NO:87. The sequence was found to contain 43 
putative (or predicted) genes. Those 43 putative gene products and genes, and gene products, 
are provided in the Appendix herein as SEQ ID NOS: 1 -86. 

[00138] The European radish population segregating for Rfo was found to be 
monomorphic for markers derived from the region containing genes 31 through 41 of the 270 
kb sequence. Since this population segregated for the restorer gene, this observation 
delimited one boundary of the potential Rfo coding region and eliminated predicted genes 31 
through 41 as candidates. This region was also eliminated by a sterile plant in the Asian F5 
population, which was homozygous for the sterile parent alleles for markers in the region 
through gene 30, but heterozygous for an allele derived from gene 31. The Asian radish F5 
population also contained a sterile plant that was homozygous for sterile parent alleles 
derived from the region extending from gene 16 through 30, but heterozygous for alleles 
from gene 9 through 15 (genes 1 through 9 were not examined). Accounting for the 
possibility of recombination within restriction fragments that define the RFLPs for these 
alleles, these observations indicated together that the Rfo locus resided within the region 
containing predicted genes 14 through 30. 
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Transformation of genes from the Rfo coding region into Ogura CMS B. napus 
[00139] Plant transformation experiments were conducted to determine the capacity of the 
various predicted genes to act as restorers of Ogura CMS in B. napus. This test was based on 
the premise that transformation of CMS plants with genomic DNA spanning a restorer gene 
should result in the recovery of male fertile transgenic plants. The genes were sub-cloned 
from clones, individually or in combination with flanking genes, into binary transformation 
vectors, and introduced into Ogura CMS B. napus plants by Agrobacterium-mediated 
transformation. The genes in the Rfo containing region and the various subclones used in 
transformation experiments are shown in Fig. 5. The region roughly corresponds to the 
region extending from the left endof clone 50E19 to the right end of clone 12M3 in Fig. 4. 
[00140] The results of these transformation experiments are summarized in Table 1 r~ 

Table 1. Plant transformation results 

Recovered plants 



Construct 


Vector 


Genets") 


Fertile 


Sterile 


P12-33 


pRD400 


14 


0 


1 


PI 1-92 


pOCA18 


14, 15 


0 


1 


Pl-Sh20 


pRD400 


15 


0 


23 1 


P2-Sh3 1 


pRD400 


16 


2 


5 


P24-Sh23 


pRD400 


17 


0 


9 1 


PI 1-76 


pOCA18 


16,17,18,19 


0 


2 


PI 1-96 


pOCA18 


15, 16, 17, 


0 


« 

1 






18, 






P3-Sh8 


pRD400 


18 


0 


2 


Pl-11 


pOCA18 


19 


0 


1 


PI 1-66 


pOCA18 


20 


0 


4 


A4-112 


pOCA18 


21, 22 


0 


l 2 


PEP-3 


pRD400 


24 


0 


5 


Bgl-5 


pRD400 


26 


1 


0 


KH8 


pRD400 


28, 29 


0 


1 


'Two of these plants had a few branches with fertile 


flowers. 



2 

Plant had some fertile and some sterile flowers. 
[Please take out the rows for A4-134 and KE7-3] 

[001 41 J Completely male fertile plants were recovered following transformation with 
clones P2-Sh31 and Bgl-5, which contain the predicted genes, Gene 16 and Gene 26, 
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respectively (Fig. 6). These male fertile plants showed no phenotypic alterations apart from 
the change in floral phenotype from Ogura CMS (left panel, Fig. 6) to the completely male 
fertile transgenically restored phenotype (right panel, Fig. 6). The transgenically restored 
flowers were indistinguishable from normal male fertile B. napus flowers. These 
observations indicate that Genes 16 and Gene 26 can each be used to derive new restorer 
lines from Ogura CMS lines that will be free from phenotypic abnormalities, including the 
high seed glucosinolate character. 

[00142] Partially male fertile plants in which one or more branches contained male fertile 
flowers were recovered in transformants with three other constructs. Interestingly, one of 
these constructs (pl-Sh20)~15ontained a gene that showed a high degree of similarity with 
Gene 16 (Gene 15, see below), while another (P24-Sh23) overlapped with P2-Sh31 and 
contained a portion of the Gene 16 coding sequence (see below). Only completely male 
sterile plants were recovered with most of the introduced cloned DNAs. All the transgenic 
plants recovered in these experiments were analyzed by PCR using primers specific for the 
introduced gene or genes, the NPTEI transformation marker, and the mitochondrial gene 
or/138, which is unique to the Ogura male sterile cytoplasm. 

[00143] Only plants that tested positive for all the transgenic sequences are listed in Table 
1 . For the Gene 16 and Gene 26 plants, these PCR analyses were confirmed by Southern blot 
analysis. Southern blot analysis also indicated that the fertile Gene 16 plants had two 
different transgene insertion events, each of which contained one or two copies of the 
introduced genes. Southern analysis of the fertile Gene 26 plant indicated that it had one 
transgene insertion site. For all completely fertile transformants additional PCR tests 
confirmed that borders of the inserted sequence were intact and matched those in the vector. 
Male fertile flowers of the plants were capable of setting seed upon selfing, and hence were 
-female, as well as male, fertile. 

[00144] The specificity of the orfl38 primers was examined by testing their capacity to 

amplify a product from normal male fertile B. napus plants. No such product was observed 

in plants without the Ogura cytoplasm. This shows that the male fertile character of the Gene 

16 and Gene 26 transformants, from which an orfl38 PCR product was obtained indicating 

the presence of the Ogura cytoplasm, is not due to an artifact arising from contaminating 

male fertile cytoplasm seeds in the seed lot used to generate the explants for the 

transformation experiments. To rule out the possibility that mtDNA rearragements involving 

only a portion of the mtDNA might be responsible for the male fertility of the Gene 16 

transformed plants, the floral mtDNAs of these plants were isolated and further analyzed by 

restriction enzyme digestion. The mtDNA restriction digestion pattern was identical to that 
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of the Ogura CMS parental plants used for the transformation experiments, (data not shown) 
indicating that no such rearrangement took place during the transformation/regeneration 
process. Collectively, the analysis of the plants transformed with different portions of the 
potential Rfo coding region indicates that at least two genes in this region are each capable of 
fully restoring fertility to Ogura CMS B. napus plants, Gene 16 and Gene 26, as defined 
herein. 

Characterization of Gene 16 

[00145] The annotation of the approximately 270 kb sequence (SEQ ID NO:87) obtained 
using the program GenScan revealed the presence, in the region contained on construct P2- 
Sh31, of a putative open reading frame starting at nucleotide 103,375 and extending to 
nucleotide 105,589, with a single intron extending from nucleotides 104,498 to 104,588. 
This Gene 16 ORF (SEQ ID NO:32) is predicted to encode a protein of 707 amino acids 
(SEQ LD NO:31). To confirm the gene prediction, 5' and 3' Rapid Amplification of cDNA 
Ends (RACE) analyses were performed on RNA isolated from radish plants homozygous for 
either the fertile or sterile alleles of genes throughout the entire Rfo region, including Gene 
16. These analyses allowed identification of the sequences present in a full length cDNA of 
Gene 16 and hence unambiguous assignment of intron locations and the 5' and 3' mRNA 
ends. A single product was obtained from both the sterile and fertile allele plants with both 
5' and 3'RACE (Fig. 7). The 5' RACE products of the sterile and fertile alleles were similar 
in size, whereas the 3' RACE product from the sterile allele was slightly smaller than the 
corresponding product from fertile plants. 

[00146] DNA sequence analysis of the fertile allele products indicated that the 5' end of 
the mature Gene 16 RNA mapped at or around nucleotide 102,847, and that the 3' end 
mapped at nucleotide 105,837. Sequence analysis of RT-PCR products confirmed the 
presence of the predicted intron and also revealed the presence of an additional intron 
upstream of the start codon that extended from nucleotides 103,036 to nucleotide 103,353. 
The structure of the Gene 16 mRNA is illustrated in Fig. 8. This Figure also illustrates the 
overlap between the Gene 16 clone and the adjacent clone (P24-Sh23) used in transformation 
experiments that gave rise to two plants with some male fertile flowers. 

[00147] The predicted amino acid sequence of the Gene 16 polypeptide (Genel6p) product 

is identical to that provided in the Appendix (SEQ ID NO:31), since the ATG start codon lies 

downstream of the first, unpredicted intron, and is shown in Fig. 9. The polypeptide shows a 

relatively high degree of sequence similarity to the Gene 15 product (Fig. 10) (SEQ ID 

NO:29), and lower degrees of similarity with its Arabidopsis ortholog (F2K11.10) and the 
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ortholog of Gene 15 (F2K11.11), as well as two paraloguous sequences in the Arabidopsis 
genome (F14N23.29 and T30E16.23). The biological functions of the Arabidopsis genes and 
proteins are not known. The similarity with these putative plant proteins lies mainly towards 
the C terminus. 

[00148] Because the products of restorer genes are expected to act in the mitochondria, 
Genel6p for the presence of a potential mitochondrial transit peptide was analyzed (targeting 
presequence) using the program MitoProtn. This analysis indicated that the N terminal 
region of Gene 16 could function as a mitochondrial targeting site, but the probability 
associated with this was relatively low (approx, 20%). However, if translation started at a 
different methionine residue (residue 419) the N terminus of the resulting product would have 
a high probability of serving as a mitochondrial targeting presequence (95%). Other 
examples where the products of alternate translation inititiation events are targeted to 
different sub-cellular sites, including mitochondria, are known (Martin N and Hopper AK 
(1994) Biochimie 76: 1161-1167). It is also possible that the mitochondrial targeting 
information is located at an internal site in the Genel6p, since there are also examples of 
mitochondrial targeting sequences that are located in the interior of polypeptides instead of at 
the N terminus (Schricker R et al, (2002) J Biol Chem in press ). 

[00149] Collectively these analyses indicate that Gene 16 encodes a protein that resembles 
a protein encoded by the adjacent Gene 15 on the radish genome and, to a lesser degree, a 
small family of proteins in the encoded by genes in the Arabidopsis genome. It should be 
noted that some fertile flowers were observed on two plants transformed with the Gene 15 
construct (Table 1), suggesting that this structural homolog can, to a more limited degree, 
functionally overlap with Gene 16 in its capacity to restore male fertility in Ogura CMS B. 
napus plants. 

[00150] A thirty- two amino acid domain was found twice near the N-terminus of gene 16 
and was found also once in genelS. A domain with the consensus 
"GTPNLAAQGTT xTPAxQxYPxMF" (SEQ ID NO:91) was found repeating tandemly nine 
times in gene 16 and seven times in gene 15 near the carboxy terminus of the respective gene 
products. A summary of the two types of repeat domains found in the products of gene 16 
and gene 15 with their consensus sequences and the locations of the repeats. in the repective 
gene products is summarized in Table 2. 



49 



WO 03/006622 



PCT/US02/22217 



Table 2 



Domain consensus 


Gene 


Domain sequences 


Location 
(Amino acid #) 


PVSSE PxQxLGSTSDxSS 


16 


PVSSEPVQPLGSTSDESSGTETTPLAPPPVTT 
(SEQIDNO:93) 


39-70 


GTETTPLAPP xTT 
(SEQ ID NO:92) 


16 


PVSSEQPAQALGSTSDQSSGTETTPLAPPITT 
(SEQ ID NO: 94) 


93-124 




15 


PVSSEPVQRLGSTSDQCSGTHTTPLAPP 
(SEQ ID NO:95) 


39-66 










GTPNLAAQGTT xTPAxQ 
xYPxMF (SEQ ID NO:96) 


16 


GSPNLATYGTTAIPAVQAYAIMF 
(SEQEDNO:97) 


494-516 




16 


GAPNFTSQGTTATPAFQAFPIMF 
(SEQ ID NO:98) 


517-539 




16 


GTPNLAAQGTTRAPAVQAYPTMF 
(SEQ ID NO:99) 


540-562 




16 


GTPNIGVQGSTPAAQTYPLMF 
(SEQ ID NO: 100) 


563-583 




16 


GTPNLAAQGTTNIGARGTTPAAQAYPLMF 
(SEQEDNO.101) 


584-612 




16 


GTPNLAAQGTTTPAVQSYPTMF 
(SEQ ID NO: 102) 


613-624 




16 


GTPNLAGQSTTTTRAGQPYPTTF 
(SEQ ID NO: 103) 


625-647 




16 


AVPQAATAPAVQPYAMMF 
(SEQ ID NO: 104) 


648-665 




16 


GTPSLGAQDITPGGQAYPA 
(SEQ ID NO: 105) 


666-686 




15 


ATPNLAAYGTTPAVQAYPMMF 
(SEQ ID NO: 106) 


431-451 




15 


GIPNLAAQGTATPSVQAYPMIF 
(SEQ ID NO: 107) 


452-473 




15 


GEPNLAAQGTTATPAFQAYPMIF 
(SEQ ID NO: 108) 


474-496 




15 


GIPNVAAQGTTTTTPAAQAYPMMF 
(SEQ ID NO: 109) 


497-520 




15 


GIPNLAAQGTTTPAAQPYPTMF 
(SEQ ID NO: 110) 


521-542 




15 


GTPSLAAQGTTTAPAVQPYPTMY 
(SEQ ID NO: 111) 


543-565 




15 


GTPNFVAQGMTPAAQAYPVNG 
(SEQ ID NO: 112) 


566-586 



X indicates possible substitu 



insertion of one or multiple amino acids. 



ions of various amino acids and _ indicates site of possible 



Characterization of Gene 26 

[00151] The GenScan annotation indicated the presence of a putative open reading frame 
on the strand complementary to that in the Appendix (SEQ ID NO:87) that extends from 
nucleotides 173,669 to 167,079, with introns extending from nucleotides 173,613 to 171,029; 
170,894 to 169,686; and 167,581 to 167,198. This region is contained within the Bgl-5 
construct. The Gene 26 ORF is predicted to encode a protein of 804 amino acids (SEQ ID 
NO:51). To confirm the gene prediction, 5' and 3' RACE analyses were performed on RNA 
isolated from radish plants homozygous for either the fertile or sterile alleles of genes 
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throughout the entire Rfo region, including Gene 26. A single product was obtained from 
fertile allele plant RNA using both 5 ? and 3' RACE (Fig. 11); neither a 5' nor a 3' RACE 
product could be amplified from sterile allele plant RNA (not shown). 

[001521 The structure of gene 26 mRNA as deduced from the analysis of RT-PCR 
products is illustrated in Fig. 12. Sequence analysis of 5' RACE product indicated that the 5' 
of the Gene 26 mRNA mapped near nucleotide 171,000 and that the 3' end mapped at or near 
nucleotide 167,000. The mature mRNA lacks the second intron sequence indicating this 
sequence is removed by RNA splicing. The sequence of the third predicted intron was found 
within the mRNA. As a result, the mRNA encoded peptide is 707 instead of 804 amino acids 
in length. Fig. 13 depicts the polypeptide (SEQ IB~NO:88) encodedjiy the newly defined 
Gene 26 ORF (SEQ LDl^O:89). The N-terminus of Gene 26p is predicted to function as a 
mitochondrial targeting presequence with a likelihood of over 98% that Gene 26p is directed 
to the mitochondria. Gene 26 contains 15 repeats of a 35 amino acid consensus sequence, 
the pentatricopeptide or PPR motif VTYNSLISGYCKAGKLEEALELFKEMKEKGIKPDV 
(SEQ ID NO:90) (Small ID & Peters N (2000) Trends Biochem Sci 25: 46-47). The 
sequences of the repeated domain (PPR (pentatricopeptide repeats) and their locations in gene 
26 product are summarized in Table 3. 



Table 3 



Domain 
consensus 


Gene 


Domain sequences 


Location (Amino 
acid #) 


VTYNSLISG 
YCKAGKLEE 
ALELFKEMK 
EKGIKPDV 
(SEQ ID 
NO: 113) 


26 


YSFN1LIKCFCSCSKLPFALSTFGKITKLGLHPDV 
(SEQ ID NO: 11 4) 


115-149 


26 


VTFTTLLHGLCVEDRVSEALDFFHQMFETTCRPNV 
(SEQ ID NO: 115) 


150-184 


26 


WFITLMNGLCREGRIVEAVALLDRMMEDGLQPT 
(SEQ ID NO: 11 6) 


185-218 


26 


ITYGTIVDGMCKKGDTVSALNLLRKMEEVSHIIPNV 
(SEQ ID NO: 11 7) 


220-255 


26 


XOYSAIIPSLCKDGRHSDAQNLFTEMQEKGIFPD 
(SEQ ID NO: 1 1 8) 


-256-289 


26 


FTYNSMIVGFCSSGRWSDAEQLLQEMLERKISPDV 
(SEQ ID NO: 1 1 9) 


291-325 


26 


VTYNALINAFVKEGKFFEAEELYDEMLPRGIIPNT 
(SEQ ID NO: 120) 


326-360 


26 


ITYSSMIDGFCKQNRLDAAEHMFYLMATKGCSPN 
(SEQIDNO:121) 


361-394 

">. 


26 


rrFN'rLIDCiYCGAKRlDDGMELLHEMTKl'ULVADT 
(SEQ ID NO: 122) 


396-430 


26 


TYNTLfflGFYLVGDLNAALDLLQEMISSGLCPD 
(SEQ ID NO: 123) 


432-464 


26 


WCDTLLDGLCDNGKLKDALEMFKVMQKSKKDLDAS 

HPFNGVEPDV 

(SEQ ID NO: 124) 


466-511 


26 


TYNILISGLINEGKFLEAEELYEEMPHRGIVPDT 
(SEQ ID NO: 125) 


513-546 



51 



WO 03/006622 PCT/US02/22217 





26 


ITYSSMIDGLCKQSRLDEATQMFDSMGSKSFSPNV 
(SEQIDNO:126) 


547-581 




26 


VTFTTLINGYCKAGRVDDGLELFCEMGRRGIVAN 
(SEQIDNO:127) 


582-615 




26 


IT^TLICGFRXVGNINGALDIFQEMISSGVYPDT 
(SEQIDNO:128) 


617-651 



[00153J The PPR motif is found in other nuclear-encoded regulators of organelle gene 
expression in both fungi and plants (Fisk DG et al (1999) EMBO J 18: 2621-2630; Coffin JW 
et al (1997) Curr. Genet 32:273-280; Manthey GM and McEwen JE (1995) EMBO J. 14, 
4031-40) and PPR protein-encoding genes represent a large gene family in the Arabidopsis 
genome (Aubourg et al (2000) Plant Mol Biol 42: 603-613). The predicted mitochondrial 
localization of Gene 26p and the presence ofmultiple PPR domains within it are consistent 
with its role in regulating the expression of the orfl38 ORF associated with Ogura CMS. 
[00154] The above example provides evidence that multiple genes within the sequence 
listed in the Appendix as SEQ ID NO:87 can function as fertility restorer genes. These 
experiments, however, do not exclude the possibility that other genes within the region can 
also function as restorer genes. For example, two partially fertile plants were obtained after 
transformation with the Gene 15 (SEQ ID NO:30) construct. Thus Gene 15, like Gene 16, 
may be able to function as a restorer gene. Similarly, two partially fertile plants were 
recovered after transformation with the Gene 17 (SEQ ID NO:34) construct, although in this 
case, the restoring factor may have been derived from expression of the portion of Gene 16 
included in the Gene 1 7 transformation construct. A plant transformed with a construct 
containing Gene 21 (SEQ ID NO:42) and Gene 22 (SEQ ID NO:44) also produced some 
fertile flowers. Gene 24 (SEQ ID NO:48), Gene 26 (SEQ ID NO:52) and Gene 27 (SEQ ID 
NO:54) also contain PPR domains and are therefore expected to restore fertility phenotype 
according to the present invention. As discussed above, the preliminary analysis indicated 
that the restorer functions are most preferably located between Gene 14 and Gene 30 within 
the Rfo region as shown in the Appendix. 

[00155] There are several reasons why not all of the transformants recovered following 
transformation with a specific construct necessarily show the same phenotype. The 
expression of the genes encoded in a specific construct may vary depending on the site of 
insertion, the number of copies of the gene at each insertion site, and other factors such as 
transgene silencing. Therefore, it is apparent that other genes in the genetically defined Rfo 
containing region can likely be found in view of the present disclosure to function as genes 
that confer complete fertility restoration. Given that Rfo maps as a single genetic locus in 
radish, the invention provides that two or more different genes in the region are able to 
function as restorer genes for Ogura CMS in B. napus. 
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SEQUENCE LISTING APPENDIX 



Location /Qualifiers 
1 . .271256 

join (95. .12 92,1600. .1619, 1623. .1628) 
/gene=" Pept ide__l M 

/note="GenScan; Pl=0.650, P2=0.068, P3 = PlyA" 
predictedpeptide 1 | 405_aa 

MCKPNVVTFTTLMNGLCREGRWEAV7VLLDRWEDGLQPNQITYGTIVDGMCK>1GDTVSA 
LNLLRKMEELSHIKPDWIYSAI IDGLWKDGRHTDAQNLFIEMQDKGIFPDIVTYSCMIN 
GFCSSGKWSEAQRLLQEMLVRKISPDWTFSGLINALVKEGDLNSAQDLLQEMISSGVCP 
NVVTCNTLLDGLCDRGKLKDALEMFKAMQKSMMDIDATHAFNGVEPDVQTYNILISGLIN 
EGKFLEAEELYEEMPHRGIVPDTVTYSSMINGLCKQSRLDEATQMFDSMGSKSFSPNIVT 
FNTLI"TGYCKAG^fVT)DGLELFCEMGRRG^VANAITYITLIRGFRKVGNINGSLDIFQE^lI 
S S GVY PDTITI RNMLTGLWS KEELKRALAMLEELQMSMVS FMRLL 
(SEQ ID NO:l) 



CDS predict ed_CDS_l | 1218_bp 

atgtgtaaaccaaatgtcgtaaccttcaccacgctgatgaacggtctttgccgcgagggt 
agagttgtcgaggccgtagctctgcttgatcggatggtagaagatggtctccagcctaac 
cagattacttacggaacaattgtagatgggatgtgtaagatgggagacactgtgtctgca 
ttgaatcttctgaggaagatggaggagttgagccacat caaaccggatgtggtaatctat 
agtgccatcattgatggcctttggaaagacggacgtcataccgatgctcaaaatcttttc 
attgaaatgcaagacaagggaatctttccagatatagttacctacagctgtatgattaat 
ggattttgtagctctggtaaatggagtgaagcccagcgcttgttgcaagaaatgttagta 
aggaagatcagccctgatgttgtaactttcagtggattgat caatgcattggtcaaagag 
ggcgatcttaattctgctcaagaccttttacaggagatgat ttctagtggtgtgtgccct 
aatgtcgttacttgtaacactttgctggacggtctctgcgatcgcgggaaactaaaagat 
gcattggaaatgtttaaggctatgcagaagagtatgatggacat tgatgctactcatgcc 
ttcaatggtgtggaacctgatgttcaaacttacaatatattgatcagcggattgattaat 
gaagggaagtttttagaggccgaggaattatacgaggagatgccccacaggggtatagtc 
ccagatactgttacctatagctcaatgatcaatggattatgcaagcagagtcgcctagat 
gaggctacacaaatgtttgattcgatgggtagcaagagcttctctccaaacatagtgaca 
tttaacacactcattactggctactgtaaggcaggaatggttgatgacgggctggagctt 
ttctgcgagatgggtcgaagagggatagttgctaatgcaattacttacatcactttgatt 
cgtggttttcgtaaagtgggtaatattaatgggtctctagacattttccaggagatgatt 
tcaagtggtgtgtatcctgataccattactatccgcaatatgctgactggtttatggagt 
aaagaggaactaaaaagggcactggcaatgcttgaggaactgcagatgagtatggtatcc 
tttatgcgtctattgtag 

(SEQ ID NO: 2) 

gene complement (join (1810. .1815,2321. .272 2,2845. .2884)) 

/gene="Peptide_J2 " 

/note="GenScan; Pl=Prom, P2=0.588, P3=PlyA" 
predict ed_peptide_2 | 133_aa 

MISSLTHMGADKAGRNQYEEEEKEEEDGIRAITLSGSNLGATMKTELDDNHGDSYKNGGQ 
ELDFLTTFVNSNFQAVNNSIMMGAKYETHDPGVHLDISGDVEKPLMKAPGRGWRERKGKT 
QARRDRRESEHTD 
(SEQ ID NO: 3) 



FEATURES 

source 
gene 



CDS predicted_CDS_2 | 402_bp 

atgatctcctccttaacccacatgggagcagacaaggcaggacggaaccaatatgaagaa 
gaggagaaggaggaggaggatggcatcagagccatcacgctctctggatccaaccttgga 
gccaccatgaagaccgagcttgatgataaccatggagacagctacaaaaacggtgggcaa 
gagcttgatttcctgaccacttttgtaaacagcaactttcaagctgtgaacaactccata 
atgatgggcgcaaagtacgagactcatgatccaggagttcatcttgatatctcaggcgat 
gtggagaaacctttgatgaaggcacctgggagggggtggagggaaaggaaagggaaaact 
caagctagaagagatcgtcgagaatctgaacatacagattga 
(SEQ ID NO: 4) 
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gene join {3108 . .314 7, 6085. .6246,6324. .6479,6565. .7151, 

77 34 . . 7 929, 8893 . . 9246, 10434 . . 10642 , 11676. . 11921 , 
11960 . . 12011 , 12023 . . 12095, 14659 . . 14749, 15776 . .16004, 
16104 . . 16109) 
/gene= " Pet ide_3 " 

/note="GenScan; Pl=Prom, P2=0.998, P3=0.958, P4=0.987, 
P5=0.521, P6=0.042, P7=0.070, P8=0.410, P9=0.456, 
P10=0.597, Pll=0.652, P12=0.882, P13=PlyA" 
predicted_peptide_3 | 784_aa 

MTSLFSKAI^ALTSLGSYFKTYRPVTGKIjHVATLTIjLLFLIjAAAVAVTSSLWLSKTTKQFD 
TPTLVTRKPVPELESPKKTGVLVNGTSLLNQNRSGSYSETSLWLNKTKSYNQPTIITTKP 
VHVPVPVPEKKSTKKTGISVDCTSFLNQNRSGSCSRTAQPGYNIsrNQTESNRSCPDYFKWI 
HEDLKPWRETG ITREMVERGQTTAHFRLVI VNGKVFVENYKKS I QTRDEFTLWG I LQLLR 

- KYPGKLPDVDLMFDCDDRPVIRIJDGYSKSIWTAENAPPALFRYGGDRWTADWFPDWSFW 

GWQEINIKPWSKVLTEMEKGKKKKKFMEREAYAYWKGNPFVASPSREDLLTCNVSSQHDW 
NARI FIQAQE I GRGAS EFMQRDLSMENVY D YMFHLLNE YSKLLKFKPQVPQNS VE I CTEA 

IWCPSGDANDTNKRFLMGSLVDEPHNSGPCSliPPPFDPNGLEKFYRKKLNIilRQVEKWEH 
AYWENHRSRSNIGLTFYRTRRTKLERIRKYRFRSESMLiKYLLDLFYGDPQVSWIYVLQE 
TQSGI RNTQKLLYI RKKYKTTL YMRFKHRSRSNI GLTF YRTRRTKLERI RKYRFRS ESML 
KYLLDLFYGDPQVSVWIYVLQETQSGIRNTQKLLYIRYFFKFQVKVFRYNFRFRISKKYN 
LGVRVKFEIPFGFSDLILGLKRFETIYSGLYDCFETLSTAINRKNCVCSQVAERRLTRRR 

IQPPSSSLSPLHEACSPSSRSSSPSILSDNRSSRVQSRRRQRRSRRCQRQRQRRIAKSNS 
TSEL 

(SEQ ID NO: 5) 

CDS predicted_CDS_3 | 2355_bp 

atgacatctctcttctccaaagcccgagctctcacttctctgggctcctactttaaaacg 

taccgtccggtcaccggaaaactccatgtggccaccttgacgcttctcctcttccttctc 

gcagccgctgttgccgtcacctcttctctatggcttagtaagacgacgaaacaatttgat 

acaccgacattagtcacaagaaaaccggtacccgagctagaatcaccgaagaaaaccgga 

gtactggtaaatggcacaagtttattgaatcaaaaccggtctggttcttactcagaaaca 

tctctatggcttaacaagacaaaatcatataatcagccgacaataataacaacaaaaccg 

gttcacgtaccagtaccagtaccagagaagaaatcaacgaagaaaaccggaatctcggta 

gattgcacaagtttcttgaaccaaaaccggtccggttcttgctcgagaacagctcaaccc 

ggctataataataaccaaaccgaatcgaaccggtcatgtcctgattacttcaagtggatc 

cacgaggatctaaagccatggagagagacggggataacgagagaaatggtggagagagga 

caaacgacagcgcatttcaggttagttatagtaaacggcaaagtgttcgtcgaaaactac 

aagaagtctatacagactagagacgagttcacactgtgggggattcttcagctgctgaga 

aagtatccagggaagttgcctgacgtggatctcatgttcgactgtgatgatcggcctgtt 

attagattggacggttacagtaaatctaatcatacagctgaaaatgcaccacctgcgtta 

tttagatacggcggagatagatggacggcggatgtcgtctttccagactggtcattctgg 

ggatggcaagagattaacataaagccatggagcaaagtgttgacagaaatggaaaaagga 

aagaagaagaagaaatttatggagagagaagcttatgcatattggaaagggaaccctttt 

gttgcatxiJ:.ccttcgagagaagatcttcttacttgcaatgtatcctcacaacatgattgg 

aatgctagaattttcattcaggctcaagagattggaaggggagcaagtgagttcatgcaa 

cgagatctatcaatggaaaacgtgtatgattacatgttccatttgttgaatgaatactca 

aagcttcttaagttcaagcctcaagttccccaaaacagtgttgaaatctgcacagaagca 

atggtgtgcccttctggagatgctaatgatactaataagagatttttgatgggctcttta 

gtcgatgagcctcacaattcaggcccatgttcgctacctcctccttttgatcccaacggt 

ctcgagaagttttataggaagaaactgaatctcatccggcaagttgagaaatgggagcac 

gcttactgggaaaaccacagatcaagatctaatataggtctaacattttaccggacccga 

agaaccaaactagaacggatccgaaaatacaggttcagatccgagtctatgctaaaafeat 

ttattggatcttttttatggggacccacaagtttcggtttggatctacgtcctacaagag 

acccaatcgggtattcgaaatactcaaaaattattatatattagaaaaaaatacaaaaca 

acactatatatgcgcttcaagcacagatcaagatctaatataggtctaacattttaccgg 

acccgaagaaccaaactagaacggatccgaaaatacaggttcagatccgagtctatgcta 

aaatatttattggatcttttttatggggacccacaagttt'cggtttggatctacgtccta 

caagagacccaatcgggtattcgaaatactcaaaaattattatatattagatattttttt 

aagtttcaggttaaggtttttaggtataatttcaggtttcggatttctaaaaaatataat 

ttgggtgttcgggtaaaatttgagatacctttcgggttttcggatctgattttgggtttg 
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aagcggtttgaaacgatttatagcggtttatatgattgttttgaaacattgtcaaccgct 
atcaaccgcaaaaattgcgtttgcagtcaagttgcagagaggaggcttactcgtcgtcgc 
atccagccaccttcgtcgtcattgtctcctctccacgaggcttgctcgccgtcttcccgg 
tcgtctt ctccaagtat r cttgagtgacaatcgcagcagcagagttcagagtcgaaggcgc 
cagaggcgaagtcgaaggtgccagaggcagagacaaaggcgcattgctaaatcaaactca 
acatccgaactttaa 
(SEQ ID NO: 6) 

complement (join (1622 3 . . 1622 8 ,17 060. .17321, 18 349. .183 67, 
2 0 705. .21068,21165. .22997,23102. .23701,23876. .23915) ) 
/gene= " Pept ide_4 " 

/note="GenScan; Pl=Prom, P2=0.681, P3=0.524, P4-0.727, 
P5=0.287, P6=0.668, P7=PlyA" 
predicted_pept ide_4 1 1025__aa 

MLTNELDKAYEEEEKYWRQRSRILWLQHGDHNSSYFHAITRSRKAANKFSVIiEKQDGTAV 
FAEEQ I AAS IEEYYDTLFTSASQGKSQI I EDAI S PRI S PEMNATLTS I PDDLEI KRAVFA 
IHRDKAPGPDGFSASFYQGFWDIIGEEVCREVRDFFISGQLHRRFNETHVRLIPKIKTPK 
TVTDYRP I ALCSTHYKVI AKI LH YLCTS SAKKHCTMAVKTDMSKAYDRLEWS FLRDVM VR 
FGFHPTWVHWIMECVSSVSYSYLINGGAHGCVVPSRGIRQGDPLSPYLFILCSEVLSGLC 
SKAFEQGKIjCGI KVARNSPPLNHLLFADDTMFFCKSDPI SCKELKNI FEKYEVASGQS IN 
CliKSAITFSAKTPIETRRRVKAELNIVGEGGIGKYIiGLPEHFNRKKRDIFASITDRIRQK 
SHSWTSRYIiNGAGKLVLLKSILSAMPTYAMTCFKLPKSLCKQIQTVLTRFWWDDKPDHRK 
MSWAWSTLTLPKRAGGLGCREIEKFNDALLAKLAWRILKFPESIiLAQTLAGKYCHSTPF 
LSTPAPKSASHGWRGVIAGREVLRQGLGWVIGNGSDINAWSDPWLSPKTPMCPMGPPTEQ 
NKELKVSDLLNGITKEWDLPAIRLHLPQYEEHILKIaVPSEFHMKDELCWLHTRSGEYSTK 
TG Y P Y W KTNRG E ELTD FNWNLC I WQ I RTS PKL KH FLWK I KS KDL» P VG ANLLH RG I Q VEGR 
CKRCGLI ETERHVFLQCPFARRVWDLVPVMFKPDPAI ITSPEALLQTSRRI VNLPPLGLG 
ETDLiYPWI FWYLW I GRNML I FENREGS EQ ELARQVS NLDVE AQC FVDAAWNAGTSGGGFG 
CIFKDMSNKTFHRSSSNRSIVGSALIAEAIxAVKAGLKAARSLGLRKLVIRSDSKSLIMAI 
NTKEKIVEAQGVLFDIDHLCTLFNSVSFEFVSHDSPRLNHNNLNADPSRWIHFTQLADPR 
LYLHHEPPPNSINHHAYRREGQDPPKGQKENEIEKRTLQEKCVLIADQDSVFSLSAMVDI 
SVSFR 

(SEQ ID NO: 7) 

predict ed_CDS_4 | 3 078_bp 

atgttgacaaacgagcttgacaaagcctacgaggaggaggagaagtattggcgtcaacgt 
agtagaatcctctggctgcaacacggagaccataattccagct act tt cat gccat tact 
agaagccgcaaggctgccaataaattctctgttcttgagaaacaagatggtactgctgtg 
tttgcagaagaacagatagcagcctctattgaagagtactatgataccctgttcacatcg 
gcgtctcaagggaaatctcagatcatagaggatgctatctcacccagaatctcccctgaa 
atgaatgcaacactcacaagtataccagatgatttggagatcaagcgagcagtgttcgct 
attcatagagacaaagctccgggacccgacggtttttcggccagtttctatcagggattt 
tgggacatcataggagaagaagtctgtcgggaggtcagagatttctttatctcgggtcaa 
cttcatcgtcggtttaacgagactcacgtccggt taataccaaaaatcaaaacgcccaag 
acagtaacggattacaggcccatagcactctgctctacacactataaggtcatcgcgaag 
at ccttcattacctctgtacctcatcagcaaagaagcattgcaccatggccgtcaaaacg 
gacatgagtaaggcctatgatcgactggagtggagtttcttgagagatgtcatggtccga 
tttggcttccaccctacatgggtgcattggattatggaatgtgttagctcggtttcgtac 
tcatacctcatcaatgggggcgctcacggttgtgttgtaccgtcgcgtgggatccgacaa 
ggcgacccgctgtctccctatttatttattttgtgctctgaggtcttatcgggactctgc 
tctaaggcgtttgaacaaggaaagctgtgtggcatcaaggtagcgcgcaattcaccacct 
ctaaatcatctcctcttcgccgatgatacgatgttcttttgcaagagtgacccgatatct 
tgcaaagaactaaagaacatctttgagaaatatgaggttgcttctggccaaagcatcaat 
tgtctgaagtctgctataaccttctctgccaaaacacccattgaaacaagacgcagggtc 
aaagcggagcttaacattgtgggtgaggggggtataggtaaataccttggcctcccagag 
cacttcaacagaaagaagagagacatatttgctagcattactgatcggatcaggcaaaag 
tcgcatagctggacctctaggtacctcaatggcgcagggaagttggtcctcttgaagtct 
attctctctgctatgcccacctatgcgatgacgtgcttcaagttgcctaagtcactctgc 
aaacagatacaaactgtgctcactcgcttttggtgggacgataaaccagatcatcgtaaa 
atgtcttgggttgcttggagtactcttactctacctaagagagctggcggtt tgggatgt 
cgggagattgagaagtttaatgatgctctgttagcaaagttggcctggcgaatactgaag 
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tttccagagtccttgctggctcaaacactggcagggaagtactgtcactccaccccattc 

ctctcaactccagctcctaaatcggcttctcatggttggcgtggagttatagcgggcaga 

gaggtgttacgccaaggcctaggatgggttattggtaacggatcggacatcaacgcctgg 

tctgatccttggctctccccaaagactccaatgtgtccaatgggtccacctacagagcag 

aacaaagagctgaaagtatcggaccttt tgaatggcattacaaaggaatgggatctgcct 

gcgattagactacacttacctcagtatgaggaacacattttgaaactagttcctagcgag 

t t ccacatgaaggatgaactctgctggctccatacacgttctggagagtact caaccaag 

acgggctacccatactggaagacaaatcgtggtgaagagctgactgacttcaactggaac 

ctctgtatatggcaaattagaacttccccgaagctgaaacatttcctatggaagatcaaa 

agtaaagatttgccagttggtgcaaatcttttacacagagggattcaagtggaagggcgt 

tgtaagagatgtggattgatagaaacagagcgtcatgtttttctacaatgcccatttgcc 

cgtcgtgtttgggatctggtgcctgtcatgttcaaaccagacccagcgatcatcacctcc 

ccggaagctcttttgcaaacctcgcgacgcatagttaatctgccaccactgggccttggt 

gaaactgatctctatccttggatcttctggtacttgtggatagggagaaacatgttgatt 

"~~ tttgagaaccgagagggatcagagcaagaactggctcgtcaggtttctaacttggatgtg 

gaggctcaatgttttgtagatgcagcgtggaatgctggcaccagtggtggaggatttggg 

tgcatcttcaaagatatgagcaacaagaccttccatcgctcttcatcaaaccgcagcatc 

gtcggctctgctctaattgcagaagctcttgcggttaaagctggcttaaaagcggcacgg 

tctctgggtttgcgtaagctggtcattcgatcagattccaagtcccttatcatggccatt 

aacaccaaagagaagattgttgaagcacaaggagttctttttgatattgatcacctttgt 

actctttttaattctgtttcgtttgagtttgtttctcacgactctccacgactaaaccac 

aacaacctaaatgctgacccatcccggtggatccacttcacacaacttgctgatccacgg 

ctatatctacatcatgaacctccaccaaactccataaatcaccatgcatatcgtcgggaa 

ggccaagatccaccaaaagggcaaaaggaaaatgaaatagaaaagaggacactacaagaa 

aaatgcgttttaatagcagaccaggatagcgttttttcgctttctgctatggttgacata 
tccgttagttttagatag 

(SEQ ID NO: 8) 

gene join (24 908. .24947,26266. .262 86,27104. .27150,273 32. .2 7454, 

2 7545. .2 7609,2 7695, .27952,28 046. .28251,28 776. .28940, 
29079. .29165,29713. -298 05,29894. .29911,3 0085. .30090) 
/ gene= " Pept ide_5 " 

/note="GenScan; Pl=Prom, P2=*O.00O, P3=0.000, P4=0.845, 
P5=0.976, P6=0.993, P7=0.996, P8=0.635, P9=0.883, 
P10=0.490, Pll=0.460, P12=PlyA" 
predict ed_peptide_5 | 3 60_aa 

MPSPLTKKPSVQWFNGKAAFRDWFSSNSSANTAGLCKSSQLKVADFSYNFFVGNIPKCLE 
HFPRTSFQGNCMQNKDLKHRPSSQCGNAQLVKTHESPSSPPKHQSAEIVAKHHKTSRPKW 
LLALEI VTGSMVALFCLVALFSAVHRWQNRPSLI I PWKKS S SLKEKFAVYVDS EMLKD VS 
RFTRQELEVACEDFSNI IGLSADSQVYKGTMKGGPEI AVI SLCVKEEDWTGYLELYFQRE 
LVDFECWKTIIoARSEK^LRNINSEGAVCVLPNAMESRYLDVSGNIYAFGVLLLEIAKEYL 
ETPEAMTSLVDPELKHFNQEELETSFDYFKKPPI S WLVRIil CSHDVECHQSS DRQHATVA 
(SEQ ID NO: 9) 

CDS predicted_CDS_5 | 1083_bp 

atgccttcgcctctaacgaagaaaccttcagtccaatggtttaacgggaaagctgccttc 

agagattggttttcttcaaattcaagtgctaatactgctggtctgtgcaagtcttctcag 

ttgaaagtagctgacttttcatacaacttctttgtgggaaacattccaaagtgtttggag 

cactttccaaggacgagct'ttcaagggaactgcatgcaaaacaaggatcttaagcacaga 

ccatcttcccaatgcggtaacgcacagctggtcaaaactcatgaaagtcccagctcccca 

ccaaagcaccagtcagctgaaattgtggctaagcatcataaaacatcaagacctaagtgg 

cttcttgcgcttgagatagtcacaggatcaatggtcgctttgttctgcctggttgcactt 

ttctcagcagttcaccgctggcaaaacaggccttctctcatcattccttggaagaaatct 

tcaagcctaaaggaaaagttcgcagtctacgttgattctgaaatgctcaaggatgtctcg 

agattcacaaggcaagagcttgaggtggcgtgtgaagacttcagcaacatcattggttta 

tctgctgatagtcaggtctataaaggaacaatgaaaggtggacctgagattgctgtgatc 

tctctttgcgtcaaagaagaagattggaccggatacctcgagctctatttccagagagag 

ctggttgattttgaatgctggaagacgattcttgcgagatcagagaagaatttgagaaat 

attaacagtgaaggtgcggtatgcgtgctcccaaacgcaatggagagtcgatatctggat 

gtgtctggtaatatatatgcatttggtgttcttttgctggagattgcaaaggagtatctt 

gaaacaccagaggcaatgacgagtttggtggatccggagctgaagcattttaaccaagaa 
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gage ttgaaacgtcttt tga eta ttttaaaaagccgccaatttcatggctggtacgactt 
atctgttctcacgacgttgagtgccatcaaagctcagaccgccagcacgcaacggttgca 
tag 

(SEQ ID NO: 10) 

gene join {30646 . .30685, 3 08 04. .31118, 3 13 22. . 3 2 725, 3 2773 . .32812 , 

32856. .33 067, 3 3172. .33294,3 33 83 . .33862,33935. .34258, 
34716. .34 873,3 54 78.. 3 553 9, 35 64 0 . .3 5758 , 3 5 824 . . 359 91 , 
3 60 91 . . 3 6123 , 3 6192 . .36197) 
/gene= " Pept ide_6 " 

/note="GenScan; Pl-Prom, P2=0.942, P3=0.995, P4=0.3i0, 
P5=0.588, P6=0.839, P7=0.999, P8=0.136, P9=0.426, 
P10=0.630, Pll=0.972, P12=0.999, P13=0.999, P14=PlyA" 

predict ed_jpept ide_6 | 1145__aa 

MATKLLSLTCIRKERFSGRYPPVXjRKHLNSRPRGGGDGSPSETAVLKIDEEEAVSDAVFR 

VTGMTCSACAGSIEKEIKRLPGIHEAVIDALNNRAQI-<^FYPTSVDVETIRETIEDAGFEA 

S L I ENEANERS RQ VCR I R I NGMTCTS CS SSI ES VLQS L YG VQRAHVALA I EE AEVH YD PT 

LLSYDKLLEEIDNAGFEAILISTGEDVSKIDLKIDGEFTDEAMTMIERSLEALPGVQSVE 

I SHGSDKI SVL YKPDVTGPRNFI RVI ESTVFGHSGHI KATVFSEGG VGRESQKQEE I KQY 

YKS FLWS L VFTVPVFLTAMVFM YI PG I KHLLMFKVINMLT VGE 1 1 RWVLATPVQF I IGWR 

FYVGSYKALRRGSANMDVLIALGTNAAYFYSLYTVIjRAATSPDFKGVDFFETSAMLISFI 

ILGK^LEIMAKGKTSQAIAKLMNliAPDTAILLTVDEEGNVTGEEEIDGRLIQK^ 

PGAJCVASDGYVIWGQSHVNESMITGEARPVAKRKGDTVIGGTLNENGVLHIKVTRVGSES 

ALAQIVRLVESAQLAXAPVQKLADRISKFFVPIAQLL^ 

SMDSFELALQFGISVIWIACPCAIjGLATPTAVMVGTGVGASQGVIjIKGGQALERAHK^C 

IVFDKTGTLTMGKPVWKTKLLKNIWLREFYELVAATEVNSEHPLAKA 

ENPTWPEARDFVSITGTGVRATVKGREIMVGNKSLMSGYKVTITADAEELLAEAEEMAQT 

GI LVS IDNELTGVLAVSDPVKPSAREAI S ILKSMNI KS I MVTGDNWGTANS I AREVGIDS 

VIAEAKPEQKAEKIKELQAAGHVVAMVGDGVNDSPALVAADVGMAIGAGTDIAIEAADIV 

LMKSNLEDVITAIDLSRKTFSRIIUiNYVWAIiGYNLMGIPlAAGVLFPSTRFRLPPWIAGA ' 

AMAAS SMLRNG PRF VSLLVS FGC I LFEDDALVLR I RRNKAE 1 1 SNRNGPS LKATRI NS AG 

MTNSlSnTTELNELYTSIKTKDKGLEILAFPCNQFGEEEPGTTDQITEFVCTRFKSEFPIFNKI 

EWGENASPLYKFLKKGKWGIFGDEIQWNFAKFLVDKNGQAVERYYPTTSPLTLEHDIKK 

L.LNLS 

(SEQ ID NO: 11) 

CDS predict ed_CDS_6 | 3 43 8_bp 

atggccacgaagctattgtcgctcacatgcatacgcaaagagagattcagcggccgttac 
cctcctgtgctgcggaaacatctaaacagcaggccacgcggcggcggagatggatcaccg 
tcggagacggcggtgcttaagatcgacgaggaggaggcggtctccgatgcggttttccgc 
gtcacggggatgacatgctccgcgtgcgctggatctatcgagaaagagatcaagcgtctt 
cccgggattcacgaagcggtcatcgacgctctcaacaaccgggctcaaatccagttctac 
cctacctctgtcgatgtggagacgattcgtgagactattgaagatgctggatttgaagca 
— tcactgatagaaaacgaggcgaatgagaggtccagacaagtatgcaggataagaataaac 
ggtatgacttgtacctcatgttcttcatccattgaaagcgtactgcaatcactttacggt 
gtacaaagagctcatgttgccttagcaattgaagaagctgaagttcattatgatcccacg 
ctcctgagctacgataaactactggaagagatagataatgctggatttgaagcgattctt 
ataagcacaggcgaagatgtgagcaagattgatttgaagatcgatggcgagtttactgat 
gaagcaatgacgatgattgaaagatcgcttgaagcacttcctggggttcagagtgttgag 
atcagccatggaagtgataagatatctgtgttgtacaaacctgatgtgacggggccgagg 
aacttcattcgtgtgatagagtctactgtctttggtcatagtggtcacatcaaggcaaca 
gtattctccgagggaggggtgggcagagagtctcaaaagcaagaagagatcaagcagtac 
tataagtcgtttctctggagtttggtttttacggtaccagtgtttttgacagccatgg.tc 
tttatgtatatccctggaattaaacatttgctaatgtttaaggtcatcaatatgctcacc 
gttggagaaatcataaggtgggttttggctacacctgtacagtttatcatcggctggaga ' 
ttctatgttggctcttacaaggctttacgccgaggatcagctaacatggatgttctgatt 
gctctgggaacaaatgcagcttatttctattcgttatacacagtgttgagagctgcaaca 
tctcctgatttcaagggagtagatttctttgagactagtgccatgctcatttcgtttatc 
atactaggaaagtatctggagataatggcaaaaggcaaaacatctcaagcgatcgcaaag 
cttatgaacttggcaccagacactgcgatattgttgactgtggacgaggaagggaatgtg 
actggtgaagaagagattgatggccgattgatacagaagaacgacgtgatcaagatcgtt 
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cctggtgctaaagtagcttccgatggttatgttatatggggacaaagtcatgtgaatgaa 

agtatgataactggagaggcaaggccagtggcaaagagaaagggtgatactgttatagga 

ggcacactgaacgagaacggtgttctgcatattaaggtgacaagggttggttcagagagt 

gctcttgcacagattgttcgacttgttgaatccgcccagctagccaaagctccagttcag 

aagttggctgatcggatatccaagttctttgttcctctggcacaactacttcttaaactt 

acgaattataatattgcagctggaaaactgcattggtaccctgaatcatggataccttct 

tcaatggatagctttgagctagctcttcagtttggaatctctgtcatggtcatagcttgt 

ccatgtgctcttgggctggctactccaaccgctgttatggttggaactggggttggtgca 

tcccaaggtgtgctgataaagggtggtcaagctctagaaagagcacacaaggtaaattgc 

attgtatttgacaagacaggaactctcacgatggggaaacccgttgttgttaaaaccaaa 

ctcctgaaaaacatggtacttcgagaattctatgaacttgtggctgcaactgaggtaaac 

agcgagcatccattggcaaaggccattgttgaatatgcaaagaaattcagagatgacgaa 

gagaaccctacgtggcctgaagcccgtgattttgtgtctatcactggaaccggagtgaga 

gcgactgt taaaggaagagagattatggtgggaaacaagagcctcatgtctggttacaaa 

gttactattacagccgatgctgaggagttgctagcagaagctgaagagatggcccagaca 

ggaattctcgtgtctatagacaatgagttaaetggagttctagctgtttcggatcctgta 

aaaccgagtgctcgagaagccatctcaattctaaaatccatgaatatcaaaagcatcatg 

gtaacaggtgacaactggggaactgcaaactccattgctagagaagtcggtatcgactct 

gttatcgcagaagctaaacccgagcagaaagcagagaaaatcaaggaactacaggctgcg 

ggtcatgttgtggcgatggttggagacggagtcaatgactcacctgctctcgtggcagcg 

gatgtaggaatggccattggtgcaggaaccgacattgcaatagaagcagctgatattgtc 

ctgatgaaaagcaacttggaagatgtgatcacagccattgatctatcgaggaaaacgttc 

tcaaggatccgtctcaactacgtatgggctctcgggtataacctcatgggcataccgatc 

gctgcgggggtgcttttcccgtctactcggttcaggttgcctccgtggattgcaggtgct 

gcaatggctgcttcttctatgctgagaaatggtccgagatttgtttctttactcgtgtcc 

tttggatgcattttatttgaagatgatgctttagttttacgcatcaggcgcaataaagct 

gaaataataagcaaccgcaacggcccatcattaaaagccactcgcattaacagcgctggg 

atgacgaactctaactacactgaattgaatgagctttacaacaagtacaaagaeaaaggt 

ctggagattctagcatttccttgtaaccaatttggtgaagaggaacctggaactactgac 

caaattacagagtttgtctgtacccgcttcaaatctgagttccccattttcaacaagatt 

gaagtgaacggagagaatgcttcccctctttataagttcttgaaaaaaggcaaatgggga 

a t ct t cggggacgagat tcagtggaact t tgccaagt 1 1 ct tgt tgacaagaatggt caa 

gctgttgaacgttactacccaactacttctcctcttacactagagcatgacatcaagaag 
cttctgaacctctcatga 

(SEQ ID NO: 12) 

gene complement (join(36226. .36231,36662. .36706,36908. .37048, 

37163. .37321,37422. .37553,3 7658. .37725,3 8166. .38634, 
39144. .39183) ) 
/gene= " Pept i de_7 " 

/note="GenScan; Pl=Prom, P2=0.859, P3=0.993, P4*0.976, 
P5=0.861, P6=0.821, P7=0.640, P8=PlyA" 
predicted_peptide_7 j 337_aa 

MSFPHYYLQRGAFTNLSPSQVASGLHAPPPPGMRPMPNPNVHHPQANNPGPHFSMADHRH 
SDFGHNIHMGMASSASVQPLQPPQMEQPLVKKKRGRPRKYAPPDGQVSLGLSPVPCSSGG 
SSKAAKDSSAVTDPNAPKRARGRPPGTGRKQRLANLGEWMNTSAGLAFAPHVISVGAGED 
IVSKVMSFSQQRPRAIiCIMSGTGTVSSVTLRQPATTESSLTYEGRFEILSLGGSYLVNEE 
GGSKSRTGGLSVSLSGPEGHVIGGGIGMLIAASLVQWACSFIYGGVNNSNKTIKQENKP 
KEEHKDSEMETNPQEEEEEEEAQLLKQQQQQQHTCRM 
(SEQ ID NO: 13) 

CDS predicted_CDS_7 1 1014_bp 

atgtcatttcctcactactacctccaaagaggagccttcaccaatctctccccttcccag 
gtcgcgagtggtctccacgcgccgccgccgccgggaatgagacctatgccgaaccctaac 
gttcatcatcctcaagctaacaacccgggtcctcatttctccatggccgaccacagacac 
tctgatttcggacacaacattcacatggggatggcttcctctgcttctgtgcagccgctg 
cagccgccacagatggagcagccgttggttaagaagaagcgtggacggccgaggaagtac 
gctcctcctgatggacaagtctctttagggctttctcctgtgccttgttcttctggtggt 
agtagtaaagcagctaaggactcttctgcagtgactgatccaaatgctcctaaacgagcc 
agaggtcgacctcctggtactggaaggaagcaacgtttagctaatcttggtgagtggatg 
aacacttcagctggacttgcttttgcacctcatgtcatcagtgttggagcaggagaagat 
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atcgtttcaaaagt catgtcattctcacagcaaagacctcgagctctttgtataatgtca 
ggcaccggaacagtttcttcagtcactctacgtcaacccgctacaacagagtcttctttg 
acatatgagggacgttttgagatactaagtctagggggatcttat ttggtgaatgaagaa 
ggtggatccaaaagccgaacaggcggtttgagtgtctccctttctggtcctgaaggtcat 
gttattggcggtgggattggaatgcttattgcagccagcctcgttcaggttgtggcttgt 
agttttatatacggaggagttaataacagtaacaagactatcaaacaagaaaataaacca 
aaggaagagcacaaggatagcgaaatggagactaatccacaagaagaagaagaagaagaa 
gaagcacagctattgaagcagcagcagcagcagcagcatacgtgcagaatgtga 
(SEQ ID NO: 14) 

gene join (404 84. .40523,40553 . .40692,412 04. .41375,4 3081. .4 3086, 

43211 . .43216) 
/gene = " Pept ide_8 " 

/note="GenScan; Pl==Prom, P2=0.955, P3=0.388, P4=0.167, 

P5=PlyAi'— — 

predicted__pept ide_8 1 105_aa 

MGIRGPNRVSVLSIQISVFRVYQNQPYSDYLKVRFGTGSGFIGFGSGVL.NVSGSIGYPFS 
^ FGFGTDNTHNPKYHETRSIRYLCRVRIGSDSFLSDRVRFEFSDSN 
(SEQ ID NO: 15) 

CDS predicted_CDS__8 | 318_bp 

atgggtattcggggtcctaatcgggtttcggttttatccattcagatttcggtttttcgg 
gtttatcaaaaccagccctattcggattatttgaaagttcggttcgggaccggttcgggt 
tttatcgggttcgggtcgggggttttgaatgtttcgggttctatcggatatccatttagt 
ttcgggt tcggtacggataatacccataacccgaaataccatgaaacaagatccatt egg 
tatttatgtcgggttcggatcggttcggattcatttttatcggatcgggttcggttcgaa 
ttttcggattcgaattaa 
(SEQ ID NO: 16) 

gene complement (join (4 3579 . . 43 584 , 44345 . . 44 516 , 44 776 . . 44966, 

45056. .45151,45243. .45341,45418. .4 5510,4 5622. .45 957, 
46034. .46247,46267. .46398,46486. .46544,46767. .46843, 
4 6 952 . .47162,47257 . .4 7373, 47455 . .4 7541, 47 903 . .4 7998 , 
4 810 2 . . 48167, 4 8264 . . 4 8446 , 48 53 7 . .48686, 48982 . .4 9164, 

4 9324 . .49410,49529. .49690,49956. .50092,50180. .50237, 

5 0366 . . 50488, 50659 . . 50716, 508 74 . . 52171, 523 60 . .523 99) ) 
/gene=" Peptide_9" 

/note='»GenScan; Pl=Prom, P2=0.869, P3 = 0.998, P4 = 0.867, 
P5=0.889, P6=0.935, P7=0.741, P8=0.896, P9=0.998, 
P10=0.617, Pll=0.999, P12=0.999, P13=0.994, P14=0.758, 
P15=0.999, P16=0.993, P17=0.879, P18=0.954, P19=0.893, 
P20=0.891, P21=0.906, P22=0.667, P23=0.951, P24=0.999, 
P25=0.984, P26=0.958, P27=PlyA" 
predict ed_peptide_9 | 14 94_aa 

MGKGRAKAVEKRVLDQKIjRGS VNVPS GPVYYPTEDEFKS PLDYI HKI KPEAEAYG I CKI V 
PPKSWKPPFGLDLESVRFPTKTQEIHRLQFRPASCNSKTFQLEYGRFVEERLGKKVKKRV 
VFEGGDLDLCKVFNAVKKFGGYDKWKGKXWGEVYQFMSSGEK^ 
HEFEKYHGMMSADPSAKGHKRNRRCSEFSSSKRRKRNNGE 

SDJOJGEVMLLCDSCNKGWHIYCLSPPLKHIPPGNWYCLDCLNTDEDTFGFVPGKCLLLED 
FKRIADRAKRKWFGSGPVSRTQIEKKFWEIVEGSGGDVEVMYGNDLDTSVYGSGFPRIGD 
RRPESVEANVWDEYCHS PWNLNNMPKLKGSMLQAI RHNINGVTVPWLYLGMLFSAFCWHF 
EDHCFYSVNYHHWGEAKCWYGVPGSAASAFEKVMRKALPDLFDAQPDLLFQLVTMLSPAV 
LQENKVPVYTVL.QEPGNFVITFPKSFHAGFNFGLNCAEAVNFATADWLPYGGYGAELYRL 
YRKPAVI SHEELLC WAKGNSCDGKGS I YLKKELLRI YSKEKTLREQLWKSGI LRSS PMF 
LPECPDSVGIEEDTTCI ICQQFLHLSAIVCSCRPSVFACLEHWKHLCECEPTKIjRIjVYRY 
TLAELHGMVQEVEKFGDCKTQETKNSQRPSSGTKRSGASNKKEGMQVCQARPAEDWIjRRA 
SKVLQDVFSSDVYATIjLKEAEQFLWGGSEMNRVRDVAKSIjIKAKIWAEAVGDCIjSKVEGK 
GNNDTEKVHLEF I DELLKVDPVPCFQSGYLKLKDFAEEARKIjSEKI DCSLSSS PTI AQME 
LLHSEVSSSPVSLSKHKILSKKISSAKMLAKRARCYLAASKPPGIELDELFKLKSEILEL 
QVTLTETEGILDLL»KKSEIARDKCSKV1jSGFISLKIT\^DLVHEFDGLCSINIREIjNILRQ 
YHVDALSWISRFDDTMADVREGKDQRKLISDLSSLLQDGASLGIQVSEIVEGLPLVEVEL 
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KKAS CREKAQTVYAARS SLDF I EQLLSEAWLQI EEEKLF VE I AGTLSTARFWEERAS S I 
LASETQMSDLKELVHPCLVFLKS FVHQKDDVRRMSVNI GAVLPSLKGI ENTI SLAETWLQ 
NSEPFLSAVSSTASSPCSLLELPVLKDLVAQSKSLGVQLEEPRILETLLLSCERWQCDNH 
PLLQETEDLLDTAKTDDGKHSTILPKIMDLITRVDCARTSGLSLGIiNLEELPKLQTASLK 
LGWCYKTI LLGSSS PS PEI PEDLGKPSLQKIQQHLEAVCSDNNINFLRLGQTIiKI LPEEY 
HLGKRLVELKI)TGQEWAKRARKVVTDSGAIiALED 

MLHCVCLKPYNSRFMVSCSQCGEWYHTYCVKLHWRPEAYVCFACCPPAESSPKNDPSRSM 
EPKTPSLDHRRARRWTGAAVGDLQWKSRKRIKRVAKRCLQVHIIjPWFFTREPK 
(SEQ ID NO: 17) 

CDS predicted_CDS_9 | 4485_bp 

a tgggaaaaggtagagctaaggcagtagagaaaagggttttagatcagaagctgagagga 
tcggtcaatgtcccgtctggaccggtttattacccgaccgaagacgaattcaaaagccct 
ctagattacatacacaagatcaagcccgaggcagaggcttacgggatctgcaagattgtt_ 
^ccaccgaaaagctggaaaccgcctt-t-cggtttggacttggagtctgtcag^gtttccgaca 
aagacgcaggagattcaccggttgcagttccgtcccgcttcttgcaactccaagacgttt 
cagctggagtacgggaggtttgtggaggagcgtttggggaagaaggtgaagaagagggtg 
gtttttgaagggggtgatttggatctgtgtaaggtgtttaacgcggtgaagaggtttggg 
ggttatgataaggttgtcaaggggaagaaatggggtgaggt ttatcagtt tatgagctct 
ggtgagaagatctccaagtgtgctaagcatgttttgtgtcagttgtataaagagcatttg 
catgagtttgagaagtatcacgggatgatgagtgcggatccttctgcaaagggtcacaaa 
aggaatcgacggtgttctgagttctctagctcgaagagaaggaaaagaaataatggcgag 
aaaaatcataaggtggaaagtgaagaagaagaggttgatcaggcatgtgagcagtgcaaa 
a 9tgacaaacatggtgaagtgatgctcttgtgtgatagttgtaataaaggttggcatata 
tactgtctctcgccgccgttgaagcatatcccgcctgggaactggtattgccttgattgc 
ttaaacactgatgaagacacttttggctttgtgccgggtaaatgtttgctacttgaagat 
ttcaagcggattgctgatcgcgccaaaaggaagtggtttggttcagggccggtgtctagg 
acgcagattgagaagaagttttgggaaatagtagaagggtcaggtggtgacgttgaagtt 
atgtatggtaatgacttggatacttctgtttacgggagtggttttcctagaataggtgat 
agaagaccagaatcagttgaggcaaatgtttgggatgaatattgccatagcccttggaat 
ctcaataacatgcctaagttgaaaggatctatgcttcaggccattcggcataacatcaat 
ggtgttacagtgccttggctatatcttggaatgctcttctctgctttttgttggcatttt 
gaggaccattgtttttactctgtgaattatcaccactggggagaagcaaaatgttggtat 
ggtgttccaggcagtgctgctagtgcttttgaaaaggtcatgcgaaaagccctacctgat 
ctctttgatgctcagccagatttgctctttcaactggttactatgttgagtccggctgtt 
ttgcaagaaaataaagtccctgtgtacacagtattacaggagcctggaaactttgtgatc 
acgtttccaaaatcctttcatgctggattcaatttcggtttgaattgtgcagaggccgtc 
aattttgccactgccgattggctaccttatggtggttatggtgcggagctgtataggctg 
taccgtaaacctgcagtcatatctcatgaagagcttctctgtgtggtagctaagggaaac 
agctgcgatggcaaaggatcaatatatttgaagaaagaactgctcagaatatatagcaag 
gaaaaaactttgagagagcagctttggaaaagtggtattttgagatcctctcctatgttt 
ctacctgaatgccctgattctgtgggcatcgaagaggatacaacatgcatcatctgccag 
cagtttctccatctttctgctatcgtctgcagctgcaggccatctgtttttgcatgcttg 
gagcactggaagcacctttgtgaatgtgaacctacaaaactgcgcctggtatatcggtat 
acccttgccgagttgcatggaatggtacaagaag^tgaaaagtttggtgactgcaaaact 
caagaaaccaaaaattcacaacggccgagttcaggcaccaaacggtcaggtgcttctaac 
aaaaaggaggggatgcaagtttgccaggcacgaccagctgaagattggcttcgtcgagca 
tcaaaggttctccaggatgtcttttccagtgatgtatatgccactcttttaaaggaagca 
gaacagtttctttggggtggatcagaaatgaaccgcgtacgggatgttgcaaaaagtttg 
atcaaagcaaagatatgggctgaagctgttggcgactgtctttcaaaagtcgaaggcaaa 
ggcaacaatgatacagagaaagttcacttggagttcatcgatgagttgctgaaagttgac 
cccgttccttgctttcagtctggttatcttaaattaaaggactttgctgaagaggctagg 
aagttgtctgagaaaatcgattgttctctgtcaagtagcccaacgatcgcccagatggag 
ctattgcattccgaagtttccagttcaccagtctccctaagtaaacacaagatcttgtca 
aagaaaatatcttccgcgaagatgttagctaaaagggcgagatgctatcttgccgcttca 
aaacctcctggaattgagttggatgaacttttcaagctaaagtcagagatattggagGtt 
caggtgacgcttacagaaacagaagggatcctggatttgctaaagaaatcagaactagcc 
cgtgataaatgtagcaaagttttgagtggttttatatctctcaagaatgtagaagatttg 
gttcatgaattcgatggcctttgtagcattaacattcgtgagctgaatatcctgaggcag 
taccatgttgatgctttgtcttggatttcacgctttgatgatacaatggctgatgttcgt 
gaaggcaaggaccaacgaaagctaatcagtgacctgagttcccttctccaggatggagca 
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tccttaggcattcaagtatctgaaatagttgaagggctccctcttgttgaggttgaattg 
aagaaggcatcttgtcgtgaaaaagcacaaacggtttatgctgcaagatcgtctctggat 
ttcattgagcagctgctctcggaagctgttgtactacaaatcgaagaggagaagctgttt 
gttgagatcgccggaac.tttgtctacagcgaggttttgggaggaaagagcaagcagtatt 
cttgccagtgaaactcagatgtccgaccttaaagaactcgtacacccatgcttagtattt 
ctcaagtcttttgttcatcaaaaagatgatgttcgtaggatgtctgtcaacattggtgcc 
gttttaccctctctgaaaggcatagagaacaccatttcgttggctgaaacttggcttcag 
aattctgagccctttttatctgccgtttcctctactgcctcatctccgtgttctctgctt 
gaacttcctgtgttaaaggacctggttgctcagtctaaatcgcttggtgttcaacttgaa 
gagccaaggattcttgaaacattactgcttagttgcgagaggtggcagtgtgataatcac 
ccgctcttgcaagaaactgaagatttgttggacactgcgaaaacagatgatggcaagcat 
agcacgattcttccgaagattatggacttgataaccagagtggactgcgccagaacatct 
ggtctgtcccttggtcttaatttggaagaacttcccaaacttcaaacggcaagtttaaaa 
ctaggatggtgttataagaccatcttgttaggctctagttcaccttcccctgagatacca 

gaagatctagga aagc cctcgttacagaaaattcagcagcacttagaagcggtatgttctr" 

gataataatatcaacttcctgcgcttaggacaaacactgaaaatattacctgaagagtat 
cacttaggcaagagacttgtggagctaaaagacactggacaagagtgggcaaaacgagct 
agaaaagtggtgacagactcgggtgctcttgccttggaagatgtattcgagcttatctct 
gagggtgaaaatttgcccgtcattgcagaggaagaacttcaggcattacgagctcgaagt 
atgttgcactgcgtttgcctgaagccatacaactcaagattcatggtttcttgtagtcaa 
tgtggcgaatggtatcacacctattgtgttaaacttcactggcggcctgaggcttatgtc 
tgcttcgcttgctgtcctccggcagaatcctctccaaaaaacgatccctccagatcaatg 
gagccaaagacaccgtcacttgaccatagacgggcaaggagggtagtgaccggtgcagca 
gttggtgatttgcagtggaaaagccgtaaacgcatcaaaagggtagctaaacggtgtctt 
caggttcatatccttccctggtttttcactcgagaaccaaaataa 
(SEQ ID NO:l8) 

gene join (52990. .53029,53942. .53981,55943. .55978,56068. .56294, 

56387. .56522,56608. .56741,56834. .56885,5703 8. .57123, 
5 723 8 . . 57426, 57519 . . 57 6 71 , 5784 9 . . 57 95 0 , 58 036 . . 58278 , 
58289 . . 58294) 
/gene = " Pept ide_l 0 " 

/note="GenScan; Pl^Prom, P2=0.308, P3=0.600, P4=0.976, 
P5=0.934, P6=0.988, P7=0.510, P8=0.940, P9=0.965, 
P10=0.876, Pll=0.964, P12=0.978, P13=PlyA" 
predicted_peptide__10 1 465_aa 

MFQYVSC I VQCGYEPQESNRGLTSSGGDDRIGGEGNDVPQFRE FS I ETLRNATSGFSTEN 

IVSEHGEKAPNVVYKGKLDNQRRIAVKRFNRKAWPDSRQFLEEAKAVGQLI^^ 

CCYEGEERLLVAEFMPNETLAKHLFHWESQPMKWAMRLRVALHIAQALEYCTGKGRALYH 

DLNAYRVLFDDDSNPRLSCFGLMKNSRDGRVTPESVMYSYGTLLLDLLSGKHIPPSHALD 

LIRDRNIQMLIDSCLEGQFSSDDGTELVRLASRCLQYEPRERPNPKSLVTAMIPLQKDLE 

TPSHQLMGIPSSASTTPLSPLGEACLRTDLTAIHEIVEKLGYKDDEGAATEMWTNQMQDS 

LNFKKKGDVAFRHKDFANAAECYSQF I EGGTMVS PTVYARRSLCHLMNEMPQEALNDAMQ 

AQVI S PAWHI AS YLQAVALTAIjGQENEAHAAIiKDGSMLES KRNAL 

JSEQ ID NO: 19) 

CDS predicted__CDS__10 | 13 98_bp 

atgtttcaatatgtttcttgcattgttcaatgtggctatgagcctcaagaatccaatcgt 
ggactcaccagctctggtggtgatgataggataggtggtgaagggaacgatgtgcctcag 
tttcgtgaattctctatagagacgctaaggaacgctacgtcagggttttctacagagaat 
atagtatcagagcatggtgagaaagctcccaatgttgtctacaaagggaagttggataat 
cagagacgtatcgctgtcaagaggtttaacaggaaagcttggcctgattctcgtcagttc 
ctggaggaagctaaagctgttggtcagttaaggaactataggatggctaatctgcttgga 
tgttgttatgaaggtgaagagagacttcttgttgctgagtttatgcctaatgaaactttg 
gctaagcatcttttccactgggagtcacaaccgatgaagtgggcaatgagactaagagta 
gctttacatattgctcaagctttggagtactgtacaggcaaagggcgtgcactctaccat 
gaccttaatgcttatagagttctctttgatgatgactcgaatccaaggctttcttgcttt 
ggtctgatgaaaaatagtagggatggtcgcgtgacaccagaaagtgtgatgtacagttat 
ggaactctgttgcttgatcttctcagtggaaaacacattcctccaagccatgcgctggac 
ctcataagggacaggaacattcaaatgttgatagactcatgtttggagggtcaattttca 
agtgatgacgggactgaactggtacggttagcttctagatgcttacagtatgagcctcga 
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gaacggcctaacccaaaatctctagtcactgcaatgatccctcttcagaaggatcttgag 

act ccttcacatcaactaatgggcataccaagcagtgcctcaacaacgcctcttt caeca 

ctcggagaagcatgcctaagaactgacctaactgccatacatgagattgttgaaaaactt 

ggttataaagatgatgagggagcagccacagagatgtggaccaaccagatgcaggactcg 

ctgaacttcaagaaaaagggtgatgttgctttcaggcataaagactttgcaaatgctgct 

gaatgttattctcagtttatagagggtgggacaatggtttcaccaactgtttatgcaagg 

agaagtctgtgtcacctgatgaatgagatgcctcaagaggcgttgaatgatgcaatgcaa 

gcccaagtgatatctcccgcttggcatatcgcatcttatcttcaagctgtagctctcaca 

gctctaggacaagagaacgaagcacacgctgctcttaaagacggatcaatgctcgaaagc 
aaaagaaacgctctatga 

(SEQ ID NO:20) 

gene join(58757. .58796,59699. .60947,6134 0. .61469,67106. .67244, 

67265. .67376, 693 64_._. 69578 , 69988 . .69993) 
/gene="Peptide_n~" 

/note=^3enScan; Pl = Prom, P2 = 0.JL56, P3 = 0.008, P4 = 0.354, 
P5=0.298, P6=0.137, P7 = PlyA" 
predicted_peptide_ll j 614_aa 

MASQCSENVIPTLRDFFNSQKPGEEDEFMVHQVLNSSSKNIPPRPTKLGNKAKERHGLFN 
QGRIRSRRNISDAEKFSVEQYSSSGFFGVRFNTNGRQQQQQRSAKPLGSDR20MEPRLQKS 
FSARMQLPFMLSSKPSNQSTNNSSWFSRIKKMSNPFSNRNSLIPKSGEIKVSGVGETLSR 
NKS S S P VHLHAHL S I QH ELGMP VFT F S IiDH PDDVYTART WMDVND S R F VYS FRY I GGRS N 

KNIrGEQRSNVSGIDSSIilGQMQVSTQVSLEVEEEPYEDPVESAVSEFVLFDIARARRSGL 

KTEQLSRQNSVSDGLKHLQRQNSFSRGLTRSFSKHSENSASSSSDPWPATDLHPGLEIAA 

VVIKDSYSSSNNESFEYMKNSKLSRREMKVIVPSGNHGLPDAENSCPTPILQRWRSGSQL 

YKHSQYVWLSCTEQKYQEEKEATLCLGAVRLESLLTWQLRQPDECSSFSKPIQHNFAKLM 

RFIWNVPGGDGDNNLNGTCWLPSLPYMTQNVQNAKAPSYSISSTKTPPPLQLLLPPPLNA 

IRHDQLAQPLLISPLIVINAFNGQECKITDRFKGFKEEETVKFYIYWRCLNRFLIRVATC 
CATHTVPATVQQNP 

(SEQ ID NO:21) 

CDS predict ed_CDS_ll | 1845_bp 

atggcatcgcagtgttctgaaaacgttattcctactctacgtgatttcttcaactcccaa 
aagccaggggaagaggatgagtttatggttcatcaggtcctcaactcctcttccaagaac 
attccaccgagacctacaaaactaggaaacaaggcaaaagagagacacggtctcttcaat 
cagggaagaataagaagcagaagaaacatatcagatgctgaaaagttctctgtggagcaa 
tactcttctagtggtttctttggggtacggttcaacacaaacggaagacaacaacagcaa 
caacgatcagctaagcctttagggtcagatagaaacatggaaccgagattgcagaagtca 
ttctccgctagaatgcaactcccctttatgctatcttcaaagccaagcaaccagtctacg 
aataattcaagctggtttagccgtatcaagaaaatgtctaatccattttcaaatcgaaac 
tctctgataccaaagtcaggagaaatcaaggtcagtggagtaggagaaacactctcaaga 
aacaagtcttcttcacctgttcatctacatgcccatctcagtatccaacatgaacttggg 
atgcctgttttcaccttctctctagaccacccggatgatgtgtatacggccaggacgtgg 
atggatgttaacgactctcggttcgtctattcgtttcgttacattggtggtaoaagcaac 

aa 9 aacctc ggagaacagaggtcgaacgtttcaggtatagactcttcacttataggacag 

atgcaagtttcaactcaagtctccttagaggtagaagaagaaccatacgaagatcctgtt 

gaatcagccgtgtcagagttcgttctttttgacatcgcacgagcacggagaagtggactc 

aagactgaacaactgtcaagacagaacagtgtatctgatggcttaaagcatctgcaaagg 

cagaacagtttcagcagagggttgactcgtagtttctcaaaacattcagagaacagtgca 

tcatcatcatctgatccttggccagccacggatttacatccaggtctagagattgcagca 

gtcgttattaaagactcttattcttctagcaataatgagagttttgagtacatgaagaac 

agtaaactctctagacgagagatgaaggttatagttccatcaggaaaccacggtttgcct 

gatgctgaaaactcatgtcctacaccgatactgcagagatggagatcaggctctcagc.tt 

tacaagcattctcagtatgtgtggctgagctgcacagaacagaagtatcaagaggagaaa 

gaagcgactctttgtctaggtgcagttcgcttagagagcttattgacatggcaactccgc 

caacctgatgagtgtagttcgttctccaagccgatccaacacaacttcgccaagctcatg 

cgtttcatctggaacgtaccaggaggagatggagacaacaatctcaatggtacttgttgg 

ttgccctcgttgccttatatgacccaaaatgtccaaaatgcaaaagcaccgtcctactcg 

atttcctcaaccaagacgcctcccccgctacaactgctcctgccgccaccactgaacgca 

ataagacatgaccaacttgcacagcctctcctaatcagcccattaattgttataaacgct 

tttaatggtcaggaatgtaaaatcacagaccggtttaaagggtttaaggaagaagagacc 
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gtgaaattttacatttactggagatgcttaaatcggttcttgatcagagtggctacgtgc 
tgcgcaacccacacggtgccagccacggtgcagcagaatccatga 
(SEQ ID NO:22) 

gene join (73 822 . . 7 3 861 , 75318 . . 75606, 75677 . . 75984 , 76063 . . 76139, 

7 626 0. .764 85, 76586. .76721,76 935. .77101,77273 . .774 08, 
78119. .7824 0, 79010. .79015) 
/gene= 11 Pept ide_12 » 

/note="GenScan; Pl=Prom, P2=0.951, P3=0.989, P4=0.507, 
P5=0.492, P6=0.997, P7=0.864, P8=0.418, P9=0.265, 
P10=PlyA" 

predicted_jpeptide_12 | 486_aa 

MEIEELELTPLRQPSPEMKQSVA/ljKPTLSKKKKKLSVACISSNKRIjLVWLGGVIiAVSSIV 
VITLFKTLPHHQSIPPPQDNSTIAIjPMALKFFNAQISGKIjPEGNNVSWRGNSCLNDGNFP 
GSLYPHLAGGYYDAGGSIKTSFTMSFSMTMLSWSVIEYGSKYEACGEVDHVKGIjIKWGTD 
— YFLRTFSSSSDTIYE^WYQIGMNQGSQVTSDL.YCWMRPEDIDYQSAKAVYLYAEAMSAKR 
KSADHWDDLIWGGAWLYYATGDNSYIiAKOTSHDLANRAGAFSHGPRYGVFGWDNKLAGTQ 
liLFTRLRLFLSPPFPYEEMLRVFHEQTSIVMCSYLPYYTKFNRTKGGLILLSEPEPLQYA 
ANAAFLATLYSDYQGASDAPGWYCGPTFFKTEILRDFSTSQGSLDPEEQESDMRRRFEVE 
GEHERESKHDRRGNGRWTRQEGWLPRRLLKESLLNNIVSTQICSFTEGDKFIISSQYCNT 
QWGKT 

(SEQ ID NO: 23) 

CDS predicted_CDS_12 | 1461_bp 

atggagatcgaggagctcgagcttacacctctgagacagccttctccggaaatgaagcag 
agttgggttctcaaaccaacactctccaagaaaaagaagaaactttctgttgcttgcatc 
tcaagcaacaaaaggt tacttgtatggcttggaggagt cttggctgtgtcgtctatcgtc. 
gtgataacactgttcaagacactcccccaccaccagagcattccaccaccgcaagacaat 
agcaccatcgcacttcctatggcattgaaatttttcaatgcccaaatctccggaaaactg 
ccagaggggaataacgtgtcttggaggggaaattcttgcttgaacgatgggaattttccc 
gggagtctttacccacatctggcgggagggtactacgatgctggaggttcgatcaagaca 
agcttcaccatgtctttctcaatgacaatgttgagctggagtgtcattgaatatgggtcg 
aaatatgaggcttgtggagaagtggaccacgtcaaagggctcattaaatggggaaccgac 
tacttcctccgcacttttagcagcagttctgatacgatctatgaaatggtgtatcagata 
gggatgaatcaaggaagccaagtgactagtgacctatactgctggatgcgaccagaagac 
attgattaccaaagtgccaaagccgtttacctttacgccgaggctatgagcgccaagcgg 
aaaagcgcagatcactgggatgacctcatatgggggggagcgtggctctactatgccacg 
ggtgataactcgtatcttgctaaggtaaccagtcacgatctagccaaccgtgccggtgcc 
ttttcccatggccctcgttatggtgtctttggttgggacaacaagcttgctgggacacag 
ttgcttttcactcggttgaggctgttcttgagccctcccttcccgtatgaagagatgctc 
agggtctttcatgagcaaaccagcatagtaatgtgctcctacttgccgtattacactaag 
tttaacagaacaaaaggtggtttgatcctgctgagcgaaccagagcctctccagtatgct 
gcgaatgcagctttcctggccactctgtacagtgactaccaaggcgcttctgacgctcct 
ggatggtactgtgggcGaactttct t caaaactgagatcctacgtgacttttcgacatct 
caagggagcctcgatcccgaagaacaagaaagtgacatgcgaaggaggtttgaagtggaa 
ggagagcacgagcgagaatccaaacacgatcgaaggggcaatggtcgctggaccagacaa 
gaaggatggcttccacgacgtttactcaaggaatcacttctcaacaatattgtttccact 
caaatctgttccttcactgaaggagacaaatttataatctcttctcaatattgtaacaca 
caagtggtaggaaagacttga 
(SEQ ID NO: 24) 

gene join(79734. .79773,86164. .86340,87980. .87985) 

/gene= " Pept ide_l 3 " 

/note="GenScan; Pl=Prom, P2=0,178, P3=PlyA" 
predicted peptide 13 |58 aa 

MIYRFWPKARFSGFGGKSRFPVIjAGNHDFPVLAEKRDFSVFAEKKFSVIiAENMISIiFW 
(SEQ ID NO:25) 

CDS predicted_CDS_13 1 177_bp 

atgatttatcggttttggccgaaagcacgattttccggttttggcgggaaatcacgattt 
ccggttttggcgggaaatcacgattttccggttttagcggaaaaacgtgatttctcggtt 
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ttcgcggaaaaaaaattctcggttttagcggaaaacatgatttctctgttttggtga 
(SEQ ID NO: 26) 

gene join (88 073. .88112,91231. .91439,91774. .91973,92397. .92435, 

93774 . . 93832, 93930 . . 94 033, 94 874 . . 94 993 , 96369 . . 96467, 
97170 . . 9754 5, 97582 . . 97587) 
/gene=" Peptide_14 " 

/note="GenScan; PI = Prom, P2=0.466, P3=0.372, P4=0.407, 
P5=0.188, P6-0.901, P7=0.218, P8=0.351, P9=0.547, 
P10=PlyA" 

predicted__peptide_14 |401_aa 

MEWPALASELDEI AALSKEFMSLS I CAI PRTQNARADGLiAKGGRTRKLI PFVSDSAPTWL 
APELASLTAVELSPHIiAIDLFSSSYRAFVIIDPVLGPVWIKVKPTSNNNHPVLGHQPGQVF 
IPIW^CLTPKVYEKPIYIVMFLPSASSFEKKLHRLKKLMQMCRNNILKRAPNPPSTTLI 
SYLIKiEITNTRINFTDNAFKFQRERLIAPMSHLLRRKTVMAPEKSSQPPDMETKLDLEP 

ESQSKRERTTEINDSIERHRRLQIFSTKKTITIMVGWFKELCLENGNPEAHYIVQYFVHK 
EKQTGLFHLRQSATRNNGNNMHLYVLLM 

KNSLSAI PVPMEQRYYVNiyrraLKPHTNCDPlsTNMAKVCKQCY 
(SEQ ID NO: 27) 

CDS predicted_CDS_14 | 1206_bp 

atggaatggccagcattggcctctgagttggatgaaattgcagcactgtctaaagagttt 

atgagcctctctatatgcgcaatcccgagaactcagaatgcccgagcggacggcttggca 

aaagggggacgtacacgcaagctcatcccgttcgttagtgactccgcacctacatggcta 

gcccctgaagctagtctaacggctgtagaactatcccctcatcttgcgatagacctattt 

tcttcttcttatcgggcctttgtaataattgacccggttcttggaccggtatggattaag 

gtgaaacctacctccaataataatcatccagtcctcggtcatcagccaggtcaggtgttt 

attccgatttggatggtatgtttgacgccgaaggtttatgaaaaacccatttatatagtg 

atgtttttgccttcggcatcgtcttttgagaagaaattgcacagattgaagaagttaatg 

cagatgtgcagaaacaacatccttaagagagctccaaatcctccatctacgacccttatc 

agttacttgatcaaacttgagatcacaaatacacgcatcaactttaccgacaacgctttc 

aagttccaaagagagagattgattgctcccatgtctcatctgttgagaagaaaaactgtc 

atggctcccgagaaatcctctcagccgccggacatggagacaaagttggatctagagcca 

gagagtcaaagcaaacgtgaaaggacaaccgagatcaatgatagtatcgagagacaccgg 

agactgcagattttttcaactaaaaaaacaataacaatcatggtcgggtggttcaaagaa 

ttgtgtttagagaacggcaatcctgaggcgcactatattgttcaatacttcgtccacaaa 

gaaaaacagacaggcctctttcatctacgccaatcagccactaggaacaatggaaataat 

atgcatctttacgttttgttaatgcttgctgaaggtcactatcaaaccggtaaaaaatat 

ttggataaactccaatggaaaaagaaacgatcaacctccgatcattgctgtgaaagaatc 

aagaactcactaagtgcaatacctgttcctatggagcagcggtactatgtaaatatggtc 

aaccttaagccacatacaaattgcgatcccaataacatggctaaagtatgcaagcaatgt 
tactaa 

(SEQ ID NO: 28) 



gene join (97759. . 97 798 , 99055 . . 100012 , 100273 . .101111, 

101293 . . 101298) 
/gene= " Pep t ide__l 5 " 

/note=»GenScan; Pl^Prom, P2=0.952, P3=0.971, P4=PlyA" 
predicted_peptide_15 | 598_aa 

MRIGVSEPEQCGCDTCVQHRTLCTQETEPSKEVTGSSVPVSSEPVQRLGSTSDQCSGTHT 
TPLAPPEPAAQSVDASSTSSSIFSSVSSQPARALCPTGSLPVPLFGCSWPRPCSCTGCSL 
LGPSIRRSSPFFTASSGSSISSSRQANVTNSFGSAASEPSVSGPMKAPIFTSGSSTASTS 
STLPSLVTPSDITRGSVQAPVQANTSKTASDFHPPNVANTGVCAASRTSTNNPFPGFSyD 
YLPRCPSNLSRPNAPTTTPVPGPSSVLAGGETEQGSRYPRYAPTPDVDGKQIISISASNS 
HGHKSHEELRWEDYKNGDKAGVGSFPPPDHTPSVFTPPSIPDRPRMRTIDLTNRDTSGFP 
IGYNTPAAFQSPHEPVGVSSPASGCTACGAASSSSPSSHLGLNSTTNPPSSATSLPGLFF 
STYGSFPLLFATPNLAAYGTTPAVQAYPMMFGI PNLAAQGTATPSVQAYPMI FGI PNLAA 
QGTTATPAFQAYPMI FGI PNVAAQGTTTTTPAAQA YPMMFG I PNLiAAQGTTTPAAQ P YPT 

MFGTPSLAAQGTTTAPAVQPYPTMYGTPNFVAQGMTPAAQAYPVNGSSLLPFAAMSLQ 
(SEQ ID NO: 29) 
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CDS predicted_CDS_15 | 1797_bp 

atgaggattggtgtttctgagccagagcaatgcggttgcgacacgtgtgtccagcatcgc 
actttatgcactcaagaaaccgaacctagcaaagaagtgactggctcatcggttcctgtt 
agttcagaaccagttcaacgtctcggttccacctcagatcagtgttccggaacacatacg 
act ccactcgctcctcctgaaccagcagcgcagt ctgttgatgcatcctccaca teat cc 
tccatattcagttctgttagttcccaaccagcgcgagctctttgtcccaccggttcactg 
ccagttcctttatttggttgttcatggccacgtccttgtagctgtactggttgttcactg 
ctaggtccttctattcgtcgttcatctcccttctttactgcttcttccggatcatcaatc 
tcctcgtcaaggcaagccaacgttacaaacagtttcggttccgctgcatccgaaccatct 
gtatctgggccaatgaaagctcctatttttacttctggctcttcaaccgcttccacatcc 
tcaactctaccttcattagttactccctcggacattacaagaggatcagtgcaagcgcct 
gtccaagGtaacacttccaagactgcttctgattttcatccacctaacgttgccaacact 
ggagtttgcgctgcttcaaggactagcacgaacaatccatttccaggatttagtgttgat 
tacttgcccagatgtccctctaacctttctcgaccaaacgcaccaactactacaccagtt 
cctggccctagttcagttttggctggtggtgaaajrtgaacaaggtagtaggtatcctcgt- 
tatgcgcctacaccagatgttgacggcaagcagattatttccatatctgcttccaactca 
catggacataaaagtcatgaagagttgaggtgggaagattacaaaaatggagacaaagct 
ggggttgggtcgtttcctcctcctgatcatacaccgtcggtgtttactcctccaagcata 
cctgatcgtcctcggatgagaactattgatctaacgaaccgagacacgagtggttttcct 
attggctacaacacccccgctgctttccagagcccccatgaacccgttggtgtttcttcc 
ccagcatcaggatgcacagcgtgtggagccgcgagtagctcctctccttcgagtcacttg 
ggct tgaacagtaccacaaatcctccatcatctgcgacatctcttcccgggctgt tcttt 
tccacctatggttcttttcctttgctgtttgcaacaccaaatcttgcagcttatggtaca 
actccagcagtccaagcctatcctatgatgtttggaataccaaatcttgetgctcaaggt 
acagcaactccatcagtt caagcgtatcctatgatatttggaataccaaatcttgctgct 
caaggtacaacagcaactccagcttttcaagcgtatcctatgatatttggaataccaaat 
gttgctgctcaaggtacaacaacaacaactccagcggct caagcgtatcctatgatgttt 
ggaataccaaatcttgctgctcaaggtacaacaactccagcggctcagccctatcctacg 
atgtttggaacaccaagtcttgctgctcaaggtacaacaacagctccagcagttcagccc 
tatcctacgatgtatggtacaccaaattttgtagctcaaggtatgactccagcagctcag 
gcctatcccgttaatggttcaagtct tctcccatttgccgccatgagtctgcagtaa 

(SEQ ID NO: 30) 

gene join (102 918 102957 , 103375 . .1044 97 , 104589 . . 105589, 

106421. . 106426) 
/gene= 11 Pept ide_l 6 " 

/note="GenScan; Pl=Prom, P2-0.985, P3=0.925, P4=PlyA" 
predicted_peptide_16 | 707_aa 

MRIDVSEPELCGCDTCVQHRTFITQETEPSKEVIGSSVPVSSEPVQPLGSTSDESSGTET 
TPLAPPPVTTPVNNPEPAAQSVGSTIPPAVTPVSSEQPAQALGSTSDQSSGTETTPLAPP 
ITTSVKSVDSTIFFKFPPVQAQAIiAPTASGSTQAPAFGFGAFAARVPSATSGCSAFSFAP 
PVTSAPVQALGTTTTTTTTTSAAAPASPFHSSSPTTFQFPPAFTSLAASTFPSVASSTSS 
PLDAPPSPFRWGSLQANTSPPFSFLPAQGSDKTGSAFTPPFGYPGGFARPDVGVSHPGFG 
PSNHFGPNAPTTTPVPVRS PFLAGGGTEQGSRYPRYSPTPDVDGRLIMS I SASNSHGHKS 
HEEL.RWEDYKNGDKGGFGWFPPVHTSPFSSPTVSPSLFAPPSIPNRPQMRTIDIjTNRDMC 
GFPIGYNTPAAFQRPPEPAGVSSPASGCTACGATSRSSPSSHLGLNNTTNPPSAATSLPG 
MFFSTYGSCPLLFGSPNLATYGTTAIPAVQAYAIMFGAPNFTSQGTTATPAFQAFPIMFG 
TPNIj7UVQGTTRAPAVQAYPTMFGTPNIGV(^STPAAQTYPLMFGTPNLAAQGTTNIGARG 
TTPAAQAYPLMFGTPNLAAQGTTTPAVQSYPTMFGTPNLAGQSTTTTRAGQPYPTTFAVP 
QAATAPAVQPYAMMFGTPSLGAQDITPGGQAYPAHGLTLPFAAMSLQ 
(SEQ ID NO: 31) 

CDS predicted_CDS_16 | 2124_bp 

atgaggattgatgtttctgagccagagctatgcggttgcgacacttgtgtccagcatcgg 
actttcattactcaagaaaccgagccgagcaaagaagtgattggctcatcggttcctgtt 
agttccgaaccagttcaacctcttggttccacctcagatgagagttcaggaacagagacg 
actccactcgctcctcctccagtcaccacaccggttaataatcctgaaccagcagcgcag 
t ctgt t ggct caa.ccatccca.ee tgctgt tacaccagt tag t tccgaacaaccagcacaa 
gctcttggttccacctcggatcaaagttccggtacagagaccactccact cgctcctcct 
atcaccacgtcggttaagtctgttgactcgaccatcttcttcaagttcccaccggtacaa 
gcacaagctcttgcccctactgcttccggttcaacgcaagcccctgcttttggttttggt 
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gcattcgctgctcgcgtaccatctgccacctccggttgttcagcatttagtttcgcccct 

cctgttacatcggcaccagtgcaagctctaggcacaaccactactactactactactaca 

tccgcggccgctcctgcatctccatttcacagttcctcaccaaccacattccaattccct 

cctgcttttacatcccttgctgcttctacttttccttctgttgcatcatcaacttcatct 

ccacttgatgctcctccctcaccatttagatggggatcactgcaagctaacacttcccca 

ccctttagcttcttgccagcgcaaggttctgacaagactggttctgcttttactccaccg 

tttggctaccctggtggttttgccagacctgatgttggtgtctctcatccagggtttggt 

ccctctaaccattttggaccaaacgcaccaactactacacctgttcctgttcgcagtcca 

tttttggctggtggtggaactgaacaaggtagtaggtatcctcgttattcacctacacca 

gatgttgacggcaggctgataatgtccatatctgcttccaactcacatggacataaaagt 

catgaagaattgaggtgggaagattacaaaaatggagacaaaggtgggtttgggtggttt 

cctcctgttcatacatctcccttttcctcaccaacggtatcaccgtcgctatttgctcct 

ccaagcatacctaatcgtcctcagatgagaactattgatctaacgaaccgagacatgtgt 

ggttttcctattggctacaacacccccgctgctttccagagaccccctgaacccgctggt 

gtttcttccccagcatctggatgcacagcgtgtggagccacgagtaggtcctctccttct 

a 9tcacttgggcttgaacaataccacaaatcctccatcagctgcgacatctcttcccggg 

atgttcttttctacctatggttcttgtcctttgctgtttggctcaccaaatcttgcaact 

tatggtacaacagcaattccagcagtccaagcctatgctattatgtttggggctccaaat 

tttacttctcaaggtacaacggcaactccagcttttcaagcctttcctattatgtttggg 

actccaaatcttgctgctcaaggtactacaagagctccagctgttcaagcctatcctacg 

atgtttggcacgccaaatattggagttcaaggttcaactccagcagctcaaacctatcct 

ttgatgtttggcaccccaaatcttgctgctcaaggtacaacaaatattggagctcgaggt 

acaactccagcagctcaagcctatccgttgatgtttggcaccccaaatcttgctgctcaa 

ggtacaacaactccagcagttcagtcctatcctacgatgtttggaacaccaaatctagct 

ggtcaaagtacaacaacaactcgagcaggtcagccatatcctacgacgtttgctgttcct 

caagctgcgacagctccagcagttcagccgtatgctatgatgtttggtacaccaagtctc 

ggagctcaagatatcactccaggaggtcaagcctatcccgctcatggtttaactctccca 
ttcgccgccatgagtctgcagtaa 

(SEQ ID NO:32) 

9 ene join(106892 . .106931,106945. .107669,108022. .108118, 

108693 108698) 
/gene= " Pept ide_17 " 

/note= n GenScan; PI = Prom, P2=0.909, P3=0.967, P4=PlyA" 
peptide_17 | 273_aa 

MARIITTLSTPLFFFFLFSLLSHQTISQPEHLTTFCNPSDNFTQTSSYEANRDLLLSSLR 
VS S S LGT YS NATVGR S PNTVHGMFL CRGDTTAAS CS DCVQTAT I E I ATNCTLNKEAVT Y Y 

EECJWRYSNVSFFSVLEVRPSIVLYSLRSAPNSbTTLNETIiADKFNQLlLNVSSSSLVPYF 
LEDQELVTQAEGS YKFESMVQCS PGLDRFNCTVCIiRFALLRVSTCCGS PSS ALI FTPKCL 
LRKPNI WENC YDDGS FG VCTCEFVNTFDDFGLK 
(SEQ ID NO: 33) 

CDS CDS_17 | 822_bp 

atggcaagaatcataactacactctctacacctctgtttttcttcttcctcttctccctg 

ctctctcaccaaaccatttctcaacccgaacacttgactactttctgcaacccttccgac 

aacttcacacaaaccagttcatacgaagcaaaccgagaccttctactctcctctctccgc 

gtcagttcctccctcggaacctattcaaacgccacagtcggtcgtagtcccaacacagtc 

cacggcatgttcctctgcagaggagacaccaccgcagcgtcttgctcagactgcgtccag 

accgctacaatcgagatcgctacaaactgtactcttaacaaagaagcggtcatatactac 

gaagagtgcatggttcggtactctaatgtttccttcttctctgttcttgaggtcagaccg 

agcatcgtcctttactctcttcgctctgctccaaactcgaatacgctcaatgaaacgtta 

gctgataaattcaaccaactgattctcaacgtgtcttcgtcctctttggttccgtatttt 

ctggaagatcaagaacttgtgactcaagcagagggttcttataagtttgagtcaatggtt 

cagtgtagtcctggtcttgaccggttcaactgtaccgtttgtctcagatttgcgctctta 

agagtttcaacttgttgcggttcaccaagttctgctctgatctttactcctaaatgtctt 

ttgaggaaaccaaatatttgggagaattgctatgacgatggcagttttggtgtttgcact 

tgtgaatttgtgaacacatttgatgattttggattgaaataa 
(SEQ ID NO: 34) 

9 ene join(108921. .108960,109245. .109298,109434. .109506, 

111201. .111292,111576. .111581) 
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/gene = " Pep t i de_l 8 *' 

/note = "GenScan; Pl=Prom, P2=0.821, P3=r0.759, P4=0.101, 
P5=PlyA" 

peptide_18 J 72_aa _ 

MEIHKSRFGSTSYKRPNQISKKYNLGVRVKFEIPFEFSDLILCYIPYPESFDIQLIALYY 
HFGFCDWLTRFC 
(SEQ ID NO:35) 

CDS CDS_1 8 | 2 1 9_bp 

atggagatccacaagtctcggtttggatctacgtcctacaagagacccaatcagatttcc 
aaaaaatataatttgggtgttcgggtaaaatttgagatacctttcgagttttcggatctg 
attttgtgttacataccttatccggaatcatttgatatacaacttatcgctttgtactac 
cattttggattttgtgactggctgacccgtttctgttag 
(SEQ ID NO:36) ____ 

gene complement (join {111677 .—11168 2 , 112729. .113089, 

11412 8 . . 114 711 , 114 812 . . 114 975, 115273 .. 1153 94 , 
11544 5 . .115631, 115752 . . 115 914, 1160 03 . .116109, 
116203. .1163 82,11646 9. .116564,11665 9. .116740, 
116823 . . 117164, 117401 . . 11743 0, 117506. .1175 98, 
117691 . . 117768, 117908 . . 117991, 118068 .. 118316 , 
118664. .118762,118 968. .119044,119232. .119322, 
122268 . . 122307) ) 
/gene= ,! Pept ide__l 9 " 

/note="GenScan; Pl=Prom, P2=0.472, P3=0.544, P4=0.999, 
P5=0.979, P6=0.999, P7=0.973, P8=0.890, P9=0 . 924 , 
P10=0.879, Pll=0.978, P12=0.991, P13=0.943, P14=0.77l, 
P15=0.475, P16=0.863, P17=0.628, P18=0.998, P19=0.996, 
P20=0.384, P21=PlyA" 
peptide_19 | 1062_aa 

MSGVYAPSDVTSFLSFDGSESRSSLDGSKKGHQNLVEWLNQTLPYLNLPSEASEDEGGSF 
EPAYVRVERFLTAMDEKAIiPRFEVSDIEQKVKMQGDMLPVFQSLKALKASFSDGGND 
LGARRRWSLPEDHSDSRGDDRNFIDGFQSKEGFEIDTSDAKISELLKSDSLRNAPTRTLF 
DMLDKLLDES VKKMNGHV S HAMAS LLSALVQ VI EQR I SNQADNIjKNQNI LFRVREDKYRS 
RIKVLETLAAGATQENEIVSNCMERTKLEKSRIEEREKSEEKDVVRLKKEKERSDAEIRK 
LKQELKVVKEAHTU^QCLELEAKAQNSTVELESKLKDAELQVAESTRKVKELEKLYIjSKSQ 
KWENRESTYQS F I DNQ FGALQALNATS VS I KQEVXjRTQKKYFEDLNYYGLKLKGVADAAK 
NYH\A^LEENRRLYNEVQELKGNIRVYCRIRPFLPGQNSGQTSIEYIGENGEI,WANPFKQ 
GKDTHRLFKFNKVFGQAATQEEVFLDTRPLIRSILDGYNVCIFAYGQTGSGKTYTMSGPS 
I TS KEHWGVNYRALNDL F HLTQLRQNT WYE VG VQMVE I YNEQ VRD I LSDEN FLNLRTLG 
WNTAIiPNGLAVPDASMHS VRSTEDVLELMNIGLMNRTVGATALNERSSRSHWI I EMTLL 
PIVYSVLSVHVRGVDVETDSVIjRGSLHLVDLAGRGQAKTLMFVQVNPDGDSYAETVSTLK 
FAERVSGVELGAAKSNKEGRDVRHLMEQVSSLKDVIAKKDEELQNVQKQKSNSTTVPKRG 
LSNLRLLGPSSPRRHSIGPSPNARRGKAPGSFGRAASDFDNCSEYSSKHSDSGSPRSSDE 
LKHRKDLHQLSKFAGGSKEIDIEDDIELIGLGDADSEDRLSDISDSCLSMGTETDGSICS 
AVELTLFPETVKPLEITEEPEPHLVPEKLEKSAKMVKTVPKDKTRTWSYQLFASFYGPDV 
PKTFI SHLRRQLNS KG I LMFDDQGTERGQAI RGS VI S I WLSKNYAS S S WCLKNLLE I L»K 
CKEQIVTTVFYGVDPSDVRKQTGDFGKAFKETCRGSKDGDKL 
(SEQ ID NO:37) 

CDS CDS_19|318 9_bp 

atgagtggcgtctatgctccctccgatgtaactagttttctcagcttcgacgggtctgaa 
agtcgttcaagcttggatggtagtaagaaaggtcatcagaatttggtggaatggttaaat 
cagacacttccttatttgaatttaccatcggaagcttcggaggacgaaggaggcagcttt 
gagcctgcttatgttagagttgagcggtttctgactgctatggatgaaatggccctgccc 
agattcgaggtttcagacatagaacagaaagtaaaaatgcagggggatatgctgccagtt 
tttcagtcccttaaggcccttaaagcaagtttttccgatggtggtaatgataaaaactca 
ctaggtgcgaggaggagatggagcttgccagaagaccattcggattccagaggagatgac 
cgcaactttattgatggattccagtcgaaggaaggatttgagattgatacatcagatgct 
aaaatttcagaattactgaaatctgacagtttacgaaatgctcctactcggacactattt 
gacatgctggataaacttctagatgagagcgtgaagaagatgaatggacatgtgtctcat 
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gcaatggcatcactcttgagcgcacttgtgcaagtgatagaacagagaatctcaaatcaa 

gctgataacctgaaaaaccaaaatatactcttcagggtacgtgaagataaatacaggtca 

agaataaaggtcttagaaaccttggcagctggggcaactcaggaaaacgagattgtttcg 

aattgtatggagcgtacaaagcttgaaaaaagcagaatagaagaaagagaaaagtcagaa 

gaaaaagatgtggtgcgtttgaaaaaggaaaaagagcgcagtgatgctgagattcgtaag 

ctgaagcaagaactcaaggtggtgaaagaggcgcatgcaaaccagtgcttggagttagaa 

gcaaaagcacaaaacagtacagttgagttggagagtaaattaaaggatgcagagttacaa 

gttgccgaatcaactaggaaggtcaaagaactcgagaagttgtacctatctaaatctcaa 

aaatgggagaacagagagtccacctaccaaagcttcatagacaaccagtttggtgctttg 

caggctttgaatgctacttcagtgtctataaagcaagaagtcttaaggacacaaaagaaa 

tactttgaggacctaaattactatggtttaaagctcaaaggagtggctgatgcagcaaaa 

aattaccatgtggtccttgaagaaaaccgaagactgtacaatgaagtgcaggaattgaaa 

ggaaatatcagagtctattgccggataagaccattccttccggggcaaaacagtggacag 

acttctatagagtacattggtgagaacggtgaattggtggttgcaaatccgtttaagcaa 

gggaaagatacccatcggttgtttaagttcaa_taaagttttcggtcaagcagcaactcaa"" 

gaggaggttttcctagatactrcgaccattaattcgatcaattcttgatggttataatgtg 

tgtatatttgcgtatggtcagacgggatctggaaaaacttatacaatgagtgggccaagc 

atcacttcaaaagaacactggggtgtcaattacagagctctgaatgacttgtttcactta 

actcagcttagacaaaacactgttgtgtatgaagtaggtgttcaaatggttgagatatac 

aatgagcaagttcgtgacatactttctgatgaaaatttccttaacttgcgtacattaggg 

gtttggaatactgccttaccaaatgggttagctgtcccagatgcaagcatgcattctgtg 

agatcaactgaagatgtgcttgagctgatgaatattgggctcatgaacagaaccgttggt 

gccacagctctcaatgaaaggagtagtagatcacactggattatagaaatgacattgttg 

cctattgtatacagcgttctttctgttcatgtacgtggtgtcgacgtggaaaccgattct 

gttttgcgtggtagtttgcacttggtcgatcttgctggaagaggacaggctaagactctt 

atgtttgttcaagtcaatcctgatggagattcttatgctgagacggttagcactctgaag 

ttcgctgaaagagtttctggtgtggaattaggtgcagctaaaagtaataaagagggacga 

gatgttagacacctcatggaacaggtatcaagcttgaaggatgttattgccaagaaagat 

gaagagcttcaaaatgttcagaagcaaaaaagtaacagtacaaccgtgccgaaacgtggt 

ttaagcaatctaagattgttggggccttcatcacctagaagacactctataggaccttca 

ccaaatgctcgacgaggaaaggcacctggttcttttgggagagcagcctcagattttgac 

aactgctcagaatacagtagcaagcattctgattccggttcaccgcgttcatcggacgaa 

cttaaacatcgaaaggatcttcaccagctatctaagtttgccggtgggtcaaaagaaatt 

gacattgaagatgacattgaactcattggccttggggatgcagattctgaggacagattg 

agtgatatctctgatagctgtctttcgatgggaacagaaactgatggctccatatgcagt 

gcagtcgagttgactcttttccctgaaaccgtgaagcctcttgaaataacggaagaacct 

gaaccacacttggtccctgagaagctcgagaaatcagcaaagatggtgaaaaccgtgccc 

aaagacaagactcgcacttggagctaccaactctttgcgagtttctacggacccgacgtc 

cctaagacttttatcagtcatttgcggagacagttaaacagcaaggggattttaatgttc 

gatgatcaagggactgagagaggccaagctattaggggatcagtgatatcgatcgtggtg 

ctttccaagaattatgcttcttccagctggtgtttgaaaaacctgctggagattttgaag 

tgcaaggaacaaatcgtgacgacagttttctacggagtagatccatccgatgttaggaaa 

cagactggagatttcgggaaagctttcaaggaaacatgtcgtggtagcaaagatggagac 
aagctttaa 

(SEQ ID NO: 38) 

9 ene complement (join (122358 . . 122363 , 124018 . . 124089, 

12 6272. .126370, 131256, . 131351 , 132828 . . 132830 , 
133476. . 133515) ) 
/gene="Peptide_20 " 

/note="GenScan ; Pl=Prom, P2=0.712, P3=0.749, P4=0.080, 
P5=0.083, P6=PlyA» 
peptide_20 | 89__aa 

MIRDFAADKVALRASPTRNFILRANDPQNEGLKSSKIjVVKLRYPARYPTSSASKSHKRTK 
THL I KLTQRQRS ALKK I QRKQL FKAS S FD 
(SEQ ID NO:39) 

CDS CDS_2 0 | 2 7 0__bp 

atgatacgtgattttgctgccgataaagttgctctaagagcatctccaacaaggaacttc 
attttaagggccaatgatcctcaaaatgagggtttgaagagctctaaacttgtcgtcaaa 
ctcagatatccggcgagatacccgacatcatcggcgagcaaatctcacaaaagaacaaaa 



68 



WO 03/006622 



PCT/US02/222J7 



acccatcttattaaattgacgcagcggcagcgttcggcgt taaaaaaaatacaaaggaaa 
cagctttttaaggcatcctcgtttgattga 
(SEQ ID NO: 40) 

gene join (134673 . .13 4712, 13 6050 . .13 6161, 136634 . . 13 673 9, 

13 6 919 . . 13 7191 , 13 8414 . . 13 8510 , 138 601 . . 13 8 672, 
13 904 8 . .13 98 09, 13 9955 . . 14047 0 , 1412 13 . .141218) 
/gene= " Pept ide_21 " 

/note="GenScan; Pl=Prom, P2=0.008, P3=0.006, P4=0.279, 
P5=0.999, P6=0.913, P7=0.935, P8=0.697, P9=PlyA" 
peptide_21 | 64 5_aa 

MRRRERRASGGSGGNGGGATGERTSTGGYSYGSVSAGENR^PENLKKHLAVSVRNIQWS 
YGIFWSVSASQPGLLEWGDGYYNGDIKTRKTVQASEVKADQLGLERSEQLRELYESLSLA 
ESSTSCGSQyNRRASASSLSPEDLTDTEWYYLVCMSFVFNIGEGVPGGVIiANGQPIWIjCN 

AHTADSK^FTRSIiIiAKSASLLTWCFPFLGGVIjEIGTTEHVAENL,NVIQCVKTLFLEAPH 

GTIiSTRSDYQEIFDPLSHDKYI PVFGTEAFPTTSTSVYEQEPDDHDSFINGGGASQVQSW 

QFVGEELSNCVHQPLNSSDCVSQTFVGTTGRVTCGPRKSRNQRLDQIQEQNNRVNMDDDV 
HYQGVISTIFKTTHQL.VLGPQFQNFDKRSSFTRWRRSSLSAKTLGEKSQNMLKKI ISEVP 
RMHQKKALLPDTPEDSGFKVGDETANHAIjSERKRREKLNDRFITLRSMIPSISKTDKVSI 
LDDTIEYLQELQRRVQELESCRESDGKEMRMAMKRKKMEDEDERVSANCLKSKRKESESD 
VNVEEDEPADTGYAGLTDNLRI GS FGNEWI ELRCAWREG I LLEI MDVI SDLNLDSHS VQ 
SSTGDGLLCLTVNCKVQLLTQAKFLLPNRNLSKPKIiIYILMLIAA 
(SEQ ID NO: 41) 

CDS CDS_2 1 | 1 9 3 8_bp 

atgcgaagaagagaaaggagagcgtctggagggtctggaggtaacggcggaggagcgacg 
ggagagagaacttcaaccggaggatactcgtacggcagcgtatctgcaggagaaaacaga 
atgatgccggaaaatctaaagaagcacctcgctgtttcagttcgaaacattcaatggagt 
tacggaatcttttggtctgtct ctgctt ctcaaccaggactgttggagtggggagatgga 
tactacaatggagacattaagactaggaagacggttcaagcatcggaagtcaaagctgac 
cagttgggtcttgagagaagtgagcagcttagagagctttacgaatctctctccctagcg 
gagtcttcaacctcctgtggttctcaggtcaatagacgggcttccgcctcctctttgtct 
ccggaagatctcaccgacactgagtggtattacttagtatgcatgtctttcgtcttcaac 
attggtgaaggagttcccggaggagtgttggcgaacgggcaaccaatatggttatgtaac 
gctcataccgccgatagtaaagtcttcactcgctctcttcttgctaaaagtgcttcgctt 
ttgacagtggtttgcttcccatttcttggaggagttcttgagatcggcacgaccgaacat 
gttgcagagaacttaaacgtgatacaatgcgtgaagacattgttccttgaagctcctcat 
ggaactttatcaacgagatccgattatcaagaaattttcgatcctttaagccacgataaa 
tacattccagtgtttggaactgaagcttttccgacaacttccacaagcgtgtatgagcaa 
gaaccagatgatcatgattcgttcatcaacggtggtggtgcatcccaggtacaaagctgg 
cagtttgtgggtgaagaactcagtaattgcgttcaccaaccgcttaattctagcgattgc 
g 1 1 1 cccagac 1 1 1 1 gt t ggaacaaccgggagag 1 1 act t gcggt ccaaggaagagt agg 
aatcaacggttagatcagattcaagaacagaataaccgagtaaatatggacgacgatgtt 
cattaccaaggcgtgatctcgacgattttcaaaacaacgcatcagctagttcttggaccg 
cagtttcagaactttgataagcggtctagtttcacacggtggaggaggtcatcattgtct 
gcaaaaacgttgggagagaagtcgcaaaatatgttaaagaagattatatccgaggttcct 
cggatgcaccaaaagaaggcgttgttaccagacacaccagaagatagcgggtttaaggtt 
9999 at 9 aaacc 9cgaaccacgccttgtccgagaggaaacgccgcgagaaattgaatgat 
cggttcataacgttgagatcaatgattccttcaattagtaagaccgataaggtgtcgatt 
cttgatgatacgatcgagtatcttcaagaacttcaaagacgggttcaagaattggaatct 
tgcagagaatctgacggtaaagaaatgcgaatggctatgaaaaggaagaaaatggaggat 
gaagatgaaagagtatcggcgaattgtttgaaaagcaagaggaaggagagtgagagtgat 
gtgaatgttgaagaagatgaaccggctgataccggttatgctggtctaacagataattjta 
aggatcggttcgtttggcaatgaggtggttattgagcttaggtgtgcttggagagaaggg 
at at tgcttgagataatggatgtcattagtgatctcaatttggactctcactcggtacag 
tct tcaaccggggatgggttattgtgtttaactgtcaattgcaaggtacagctactaaca 
caagctaaatttctgttacctaaccggaatttgagtaaaccgaaactgatatatatttta 
atgctgattgcagcataa 
(SEQ ID NO: 42) 

gene complement (join (142728 . . 142733,143177. . 143658, 
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144 696 . . 14 522 0 , 14 53 03 14 53 99, 14 5477 . . 14 64 72 , 
14 6553 . . 14 662 0 , 14 670 0 . . 14 688 9 , 147267 14 7 3 06) ) 
/gene= " Pept ide_22 " 

/note="GenScan; Pl=Prom, P2=0.971, P3=0.999, P4=0.739, 
P5=0.946, P6=0.078, P7=0.105, P8 = PlyA" 
peptide_22 | 785_aa 

MEKRRS PRSSKS PGTPI FPKSPLVYES YKSGCGWKXjINFFDFRHVKSGNKRLSSQKKPI R 

DSAGNVYTKSQLDLLKRLHERCQCHDRIVEGENSCKPKTRRRSFSSEREDESYESKPVQG 

LLEREIKRIKNAKEETSLSEVEKAKQMDLKNGRDCKKKSSEINLQVCWEAAETLISSKA 

EEKGKDRSKQFMEAIiDILSSNKELFITLLQDPNSFSAKKGQDLERPHVKERRDKSPSLAD 

DLDEIVLLKPRLPSLVDDRKYLRFKHLTKKljKLVVGSNKDTEASGDVSSSAVGYRSPESP 

VFRRKKRVESDVFKLSIEKDVSPRRFTVERQQERSDSSPVYEVPKALSSLQTKLKERRER 

LEKRRESFKLWSLDKEDLEVFDPNPHSYNVRSLSEKTSLRAPVEDGLEEDRYLESSSAES 

SiraQEQEQPSPVSVLERIHMLDETVGLRNKEQIGLLSFDLVEraSVHEE^KQLLQASRL 
NWTNLMARCNEETSL.LDEFSHGN™ 

WNVPAS FREEDL IHETLRRFDWSLiLCCDS PKALDQVVEADLI KPSCIiWLDCGGEAEGVV — 

SDWENIMQGLWEISHELRTMQIRLQVSHRFCWYGIVPRGPLDLSTAPDKTRLHGEAI 

DFIT^QDVHKLAQSHLESATTKYKliAADTKRCELIFEPGDLVWVYLiTK^RLP 

KSKKiGPVEWERINPNVYRVRLPSHLRTSDVFNIKHLSPFKGDNDDPDSWANPSQPGGP 
DAAAS 

(SEQ ID NO: 43) 

CDS CDS_2 2 | 2 3 5 8_bp 

atggagaaaaggagatcaccaagaagcagcaagagtcctggaactccaattttccctaaa 

agccctcttgtctacgagagttacaagtctggttgtggttggaaactgattaacttcttc 

gatttcagacatgttaaatctggtaataagagactgagttcacagaagaaacccatcaga 

gactctgctggtaatgtttatactaaaagccaacttgatttgctcaagagacttcatgag 

agatgtcagtgtcatgatcggattgtggaaggagagaactcatgtaagcctaaaacgagg 

agaagatcctttagttctgaaagagaagacgagagctatgaatcaaaacctgttcagggt 

ttactagagagagagataaagaggatcaagaatgcaaaggaggaaacttctttgagtgaa 

gttgagaaggctaagcaaatggatctgaagaatgggagagattgtaaaaagaagagctca 

gagattaaccttcaagtttgtgtgaatgaagctgctgaaacattgatcagttccaaggca 

gaggagaaaggaaaagaccggtccaagcagttcatggaagcattagatattctaagctcg 

aacaaagagttgttcattacactcttacaagatcctaactcgttctcagccaaaaaaggt 

caagacttggagagacctcatgtgaaggaacggcgtgataaatctccttcactggctgat 

gatttggatgagatcgttctcttgaagcctagactaccaagcttggttgacgacagaaag 

tacttgagattcaaacatctaaccaagaagttgaagcttgttgttggatctaacaaagac 

accgaggcttctggcgatgtttctagcagtgcagttggctatagaagcccggagt caeca 

gtgtttaggcgcaagaaacgtgttgagtctgatgtcttcaagctaagcattgagaaggat 

gtttcgccaaggaggttcacagtggaaaggcaacaggagagatcagattcttcaccggtt 

tatgaagttcctaaagcactgagtagcttacagacaaaactcaaagagagaagagagagg 

ctggagaagagaagagagagcttcaagttgtggtccttggacaaggaggacttggaggtt 

ttcgatccaaatccgcacagctacaatgtaaggtctctcagtgagaaaacaagcctgaga 

gctcctgtagaagacggactagaggaagatagatatctagagagctcatcagctgagtcg 

agtataaaaagacaggaacaagagcaaccgagtcctgtctctgtactagagaggattcat 

atgttagatgaaacagttggcctcagaaacaaagagcagattggactgttatcttttgat 

cttgtcgagaaagactctgttcacgagtttgtgaagcaacttctccaagcttcaagactg 

aactggaccaatcttatggcgagatgcaacgaagaaacatcactactagacgaattctca 

cacggtaatcacaacaacgaccagcttcttcttgttcttgactacacagacgaaattctc ' 

cgcgagatttaccgtcaagacatcaagttttggcctttcaagccgtctcagagctcgaga 

gtcgtcaatgtaccagcttcctttagagaagaagatctgatacatgagacgttgagacgt 

tttgactggagtttactctgctgtgactctccaaaagcattggatcaggtcgttgaggct 

gatctgataaagccatcatgtctttggttggattgtggtggtgaagctgaaggtgtagtc 

tctgatgtcgttgaaaatattatgcaagggttggtggttgagatatctcatgagctaaga 

acaatgcagattaggttacaggtttctcaccgtttttgtgttgtgtatggtattgttcct 

cgtggtccgttggatctttcgactgcaccagacaaaactcgccttcatggagaggccatt 

gactttattaccaaccttcaagacgtccataagctcgcgcagtctcatttggaatcggct 

acgactaaatacaaacttgcagcagacaccaagcgttgtgagctgatcttcgaaccaggt 

gatctcgtttgggtgtacttaactaaagaacgactccccttgcgtgattacaataaattg 

aagtctaaaaagcttggtccagtcgaagtggtggaacgcatcaatccgaatgtgtatcgt 

gtcaggttgccttcacaccttcgtacgtctgacgtcttcaacatcaagcatttgtctccg 
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ttcaaaggcgacaatgatgatccagattcgtgggcaaatccttctcaaccgggaggacct 

gatgcagcagcatcataa 
(SEQ ID NO:44) 

gene join ( 147355 . .147394,147557. .147628,149555. .149586, 

15 0 051. .150111,15593 3. .157925,158 007. .158123, 
15 8 202 . .158377, 158531 . . 158676, 158813 . . 158831 , 
159102 . . 159107) 
/gene= " Pept ide_23 " 

/note= M GenScan; Pl=Prom, P2=0.591, P3=0.063, P4=0.049, 
P5=0.902, P6=0.998, P7=0.911, P8=0.987, P9=0.974, 
P10=PlyA" 
peptide_2 3 | 871_aa 

MGPLTIKEAFSLVLEiCEGEEEEDEIFRILGSSDRRNIDTIRIFEGPRFGSGPDISWAILV 
GLVXXXPNPPRITNTRNAPYCRLATKKNSYPLRKTFLHHTPSPVSPMAFTFFSPHPVFLS 
LGRTTSSFSSYKPAYSPFSRNPRNNLQLiAAGPTRRSSYPNPADDDPPEAPEDSMHGVSKF 
QQIQRQAARARKLEEEDFEKNRNTYLSAIADVEDAPETGRDDVESGGDLFSDIDRAISMK 
RSEFVKKGLIjQPNPPKTASSKKIDEEEEEDDAVDEIiDEEEAVDLDEIDKLTGLTEASDEE 
DWVDEEGNPRIISKKKEHQFEFDLDDFGESKARIVEPKFRLSLAELLDESKVVPISVYGD 
LDVE ITG I QHDS RGVS AGDLF VCCDGGDDS VLS EADKRGAVAWAS KE I D I EDTLGCRAL 
VI VEDTEAVIiAALAS S FYRHPS KDMAVI GVTGTNGKTTTT YL I KSLYEAMGVRTGMFS S V 
SCYVHGDNKMDSTTTSPDAVLVQSMI^KMIjHNGTEAIjVMEASPQEIjASGKCDEVDFDIAV 
FTNIiAREDSGFRGTDEEYRDAEAKLFAR^r^7DPERHRKVVNIDDPNAAFFVQQGNPDVPVV 
TFAMENTKADVHPLKFELSLFETQVIiLm , PQGILEISSGLLGRHNIYNI3^AAVAVGIAVG 
APLEDIVRGVEEVDAVPGRCELIDEEQAFGVIVDHANTPDGLSRLLDSVRELKPRRIITV 
IGCAGETERGKRPVMTKIATEKSDVTMLTSDNPGNEDPLDILDDMLAGIGWTMQEYLKHG 
EHDYYPPLANGHRLFLHDIRRVAVRCAVAMGEEGDMVVVAGKGHEAYQLEGDKKEFYDDR 
EECREALQYVDELHQAGIDTSEFPWRLPESH 
(SEQ ID NO: 45) 

CDS CDS_23 | 2616_bp 

atgggtcctctcaccattaaagaggcttttagtctcgtcctggagaaggagggagaggag 
gaggaagatgaaatatttcgaattttaggtagttcggatagaagaaatatagacaccata 
cggatatttgaaggtcctcggttcggttcgggtccggatatttcgtgggctattttggtg 
ggcttggtcattattacgccaaacccacctcgtatcactaacactcgaaacgcaccgtat 
tgtcgtttggcaacaaaaaaaaactcttatccccttcgaaaaacctttcttcatcacact 
ccctctcctgtctctccaatggccttcactttcttctctcctcaccccgtcttcctctct 
ctaggtcgaaccacctcttccttctcctcctacaaaccggcatactcaccattctcccga 
aatccccgtaataatcttcagttagcggcgggccccacccgtcgtagctcttacccgaac 
ccggcggatgacgacccgcccgaagccccggaggattcgatgcacggcgtctccaagttt 
cagcagatacagcgccaagccgctcgagcgcggaagctggaggaagaagacttcgagaag 
aaccgaaacacgtacctctcggccatcgctgacgtggaagacgcgcccgagacgggacgc 
gatgacgtggagtctggaggcgatctgttctcggacatcgatagagccatctcaatgaaa 
cgtagcgagttcgttaaaaaaggactgctccaacctaaccctcccaaaaGggcgtcgtct 

. aagaagatcgacgaggaagaagaagaagacgacgctgttgatgagcttgacgaagaagaa 

gctgtggatttagacgagatcgataaactgactggattaaccgaagcttccgacgaagaa 
gactgggtcgacgaggaagggaaccctaggatcatcagcaagaagaaggagcatcaattc 
gagttcgatttggatgatttcggcgaatccaaggcgagaatcgtggagcctaagttcaga 
ctgagcttagccgagctcttggacgagagcaaagtggtgccgatctcagtctacggcgac 
ttagacgtcgagatcaccggaatccagcacgattcgcgaggcgtaagcgccggagatctc 
ttcgtgtgctgcgacggaggagacgactccgtcctgagcgaagctgacaagagaggagcg 
gtggcggttgtggctagcaaagagatcgatattgaagatacgttaggctgtagagcgctc 
gtcatcgtcgaagacaccgaagcagtcttggctgcgttagcttcttcgttttataggcat 
ccgtcgaaggacatggcggttatcggagtcacggggactaacgggaagacgaccaccacg 
tatttgattaaaagcctctatgaagctatgggtgtgagaacaggaatgttcagcagtgtt 
tcttgttacgtccatggagataacaagatggattcaacgacgacgagtcctgatgctgtt 
ttggttcagagtatgatggcgaagatgttgcataatggaaccgaagctctggttatggaa 
gcttctcctcaagaactcgcttcagggaaatgcgacgaagttgatttcgacattgcggtc 
ttcacgaatttagccagagaggatagtggctttcgcggtactgatgaggagtatagagat 
gctgaagccaagttgtttgcaagaatggtcgacccggaaagacacaggaaagtggttaac 
attgacgatccaaacgcagcgt ttttcgtccagcaagggaaccctgatgttcctgttgtg 



71 



WO 03/006622 



PCT/US02/22217 



acgtttgcaatggagaacacgaaagcggatgttcacccgttgaagtttgagctgtctttg 
tttgagacacaggttttgctcaatacacctcagggtatactggaaatttcgtctggtttg 
ttaggacggcataacatttataacattcttgctgctgttgctgtcgggatagctgtcgga 
gctcctcttgaggatattgttagaggtgttgaggaagtcgatgctgtcccggggaggtgt 
gagttgattgatgaggaacaagcttttggtgttattgtggatcatgctaacacacctgat 
ggtttgtcaaggctgcttgattcggttcgagagcttaagccaagaagaatcattactgtt 
attggctgtgcgggtgagactgagagagggaaacgaccggttatgacgaaaatcgcaact 
gaaaagagtgatgtgacgatgttgacatctgataatccggggaatgaagatccattggac 
atattggatgacatgttggctgggattggatggacgatgcaagagtatctgaaacacgga 
gaacatgattactatcctccattggcaaatggtcatagactcttccttcacgacattaga 
cgtgtagctgtgcgttgtgctgttgcgatgggtgaagaaggtgacatggttgtggtagca 
gggaaaggccacgaagcgtatcagcttgaaggtgataagaaagagttctatgatgatcga 

gaggaatgtcgggaagcattacaatacgttgatgagcttcatcaagctggaatagacaca 
agcgagttcccatggaggttaccagagagt cat t aa 
(SEQ ID~NO:46) 



gene complement (join (159434 . . 15943 9, 15 968 5 . . 159707, 

15983 9. .161876,161980. .162 019) ) 
/gene="Peptide__24 " 

/note="GenScan; Pl=Prom, P2=0.887, P3=0.985, P4=PlyA M 
peptide_24 | 68 6_aa 

MLARVCRFESSSSSSVSAARFFCTGSIRHALAEKSRDGESGEAGFRGESLKIiRSGSYEIK 
GLEDAIDIiFSDMLRSRPLPSVIDFNKLMGAWRMERPDLVISLYQKMERKQIRCDIYSFT 
ILIKCFCSCSKLPFAIiSTFGKIjTKIiGLHPDVVTFTTLLHGLCLDHRVSEAliDLFHQICRP 
DVLTFTTIjMNGLCREGRVVEAVALLDRMVENGLQPDQITYGTFVDGMCKMGDTVSALNLL 
RKMEEI SHI KPNWI YSAI IDGLCKDGRHSDSHNLF I EMQDKGI FPNI VTYNCMIGGFC I 
SGRWSAAQRLLQEMLERKI S PNWT YNALI NAFVKEGKFFEAAELYDEMLPRGI I PNT I T 
YNSMIDGFCKQDRLDAAEDMFYLMATKGCSPDVFTFTTLIDGYCGAKRIDDGMELLHEMP 
RRGLVANT VT YNTLI HGFCLVGDLNAALDLSQQM I S SGVC PD I VTCNTLLDGLiCDNGKLK 

DALEMFKAMQKSKMDLDASHPFNGVEPDVIiTYNILICGLINEGKFLEAEELYEEMPHRGI 
VPDT I TYSSMI DGLCKQS RLD E ATQMF VSMGS KS FS PNWTFNTL I NG YCKAGR VDDGLE 

LFCEMGRRGIVADAI I YITLIYGFRKVGNINGAX1DIFQEMISSGVYPDTITIRNML1TGFW 

SKEELERAVAMLEDLQMSVGYQLEDE 

(SEQ ID NO: 47) 

CDS CDS_24 |2061Jbp 

atgttggctagggtttgcagattcgagtcttcctcttcgtcttctgtgtctgcggctaga 
tttttctgtacgggatcgattcgtcatgctctggccgagaaaagcagggatggagagagt 
ggcgaagcaggttttagaggagagagtttgaaactgcgaagtggatcttatgaaatcaaa 
gggttagaggatgcgattgatttgttcagtgacatgcttcgatctcgtcctttaccttct 
gtgattgatttcaacaagctaatgggtgcggtggtgagaatggaacgcccggatcttgtg 
atttctctctatcaaaagatggaaaggaaacagattcgatgtgatatatacagcttcacc 
attctgataaaatgtttctgcagttgctctaagctcccctttgctttgtctacatttggt 
aagctcaccaagcttggactccaccctgatgttgttaccttcacoaccctgctccacgga 

ttat 9 tctt g atcaca gggtttctgaagccttggatttgtttcatcaaatttgtagacca 

gatgtcctaacgttcaccacgctgatgaatggtctttgccgcgagggtcgagttgtcgaa 

gccgtagctctgcttgatcggatggtggaaaatggtctccagcctgaccagattacttac 

ggaacatttgtagatgggatgtgtaagatgggcgacactgtgtctgcattgaatcttctg 

aggaagatggaggagataagccacatcaaacccaatgtggttatctatagtgccatcatt 

gatggcctttgtaaagatggacgccatagcgattctcataatcttttcattgaaatgcaa 

gacaagggaatctttccaaatatagttacctacaactgtatgatcggtggattttgcatc 

tctggtagatggagtgcagcccagcggttgttgcaagaaatgttagaaaggaagatcagc 

cctaatgttgtaacttataatgctttgatcaatgcatttgtcaaggaaggcaagttcttc 

gaggctgcagaattatacgatgagatgcttccaaggggtatcattcctaatacaatcaca 

tataattcaatgatcgatgggttttgcaaacaggatcgtcttgatgctgctgaggacatg 

ttttatttgatggctaccaagggctgctctccggacgtattcactttcactactctcata 

gacggatattgtggggctaagaggatagatgatggaatggaacttctccatgagatgcct 

agaagaggattagttgctaacacagttacttacaacactcttattcacgggttctgtctg 

gtgggcgatcttaatgctgctctagacctttcacagcagatgatttctagtggtgtgtgc 

cctgatatcgttacttgtaacactttgctggacggtctctgcgataatgggaaactaaaa 

gatgcattggaaatgtttaaggctatgcagaagagtaagatggatcttgatgctagtcac 
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cccttcaatggtgtggaacctgatgttctaacttacaatatattgatctgcggcttgatc 
aatgaagggaagtttttagaggccgaggaattatacgaggagatgccacacagaggtata 
gtcccagatactatcacctatagctcaatgatcgatggactatgcaagcagagccgccta 
gatgaggctacacaaatgtttgtttcgatgggtagcaagagcttctctcccaacgtagtg 
acatttaacacactcattaatggctactgtaaggcagga.agggttgatgatgggctggag 
cttttctgcgagatgggtcgaagagggatagttgctgatgcaattatttacatcactttg 
atttatggttttcgtaaagtgggtaatattaatggggctctagacattttccaggagatg 
atttcaagtggtgtgtatcctgataccattactatccgcaatatgctgactggtttttgg 
agtaaagaggaactagaaagggcagtggcaatgcttgaggatctgcagatgagtgtgggg 

tatcagttggaggatgaatga 
(SEQ ID NO: 48) 

gene join (162193. .162232,163 044. .163 095,164032. .164606, 

165002 . . 165007) 
/gene= " PeptTde_25 " 

_ /note="GenScan/ Pl-Prom, P2=0.976, P3=0.992, P4=PlyA" 

peptide_25 | 208_aa 

MT^GQNQPRTRSYSTLIiAQTNQNPSATECSGCQPPRRSNXjGRIKIRVHILVLiERHASRRP 
AHHHRFTLPPGNHTSVHRNPTGAHKLHTSSSRTLFRSRMALIGESINRRYEISSPWYRT 
HPRERFTCITNLPAGDLHPPPTRPRSTKPKRTTKRRKEKKKMjNPNKLGAESRRRKTVEI 

I T PRRL EPMTVEPKRLPRRGYE GRRRW S 
(SEQ ID NO: 49) 

CDS CDS_25 | 627Jbp 

atgaaccttggccagaatcaacctcgtacccggtcttactcaactcttttagcccaaaca 

aaccaaaaccctagcgctacggaatgctcgggatgtcagccgccgcgtagatctaacctt 

gggagaatcaagatacgtgttcacatcctcgtcctcgagagacacgcgtcacgcaggcca 

gcacaccaccaccgcttcacgctaccaccggggaaccacacctGagttcaccggaaccca 

accggagctcacaaactccacacatcctcatcgcgaacactttttcgatctaggatggct 

ctaatcggagagagcatcaatcgtcgctacgagatctcatcccccgtagtttatcgcaca 

caccctagagaacgcttcacatgcatcaccaaccttcccgccggagaccttcaccctcct 

ccgacgagaccacggtcaaccaaacccaaacgaaccacaaagagaagaaaagaaaaaaaa 

aaaaacctaaaccctaataaactaggagccgaaagccggcggcgcaagacagtggagatc 

atcacaccccggagactagaaccgatgacggtggaaccgaagaggcttccccgtcgcgga 

tacgaaggccggcgtcgatggagctaa 
(SEQ ID NO:50) 

gene complement ( j oin ( 1668 63 . .166 86 8,16707 9. .167197, 

167582 . . 169685, 17 08 95 . .171030, 173614 . .173669, 
173934 . . 173973) ) 
/gene= " Pept ide_26 " 

/note="GenScan; PI = Prom, P2=0.664, P3=0.657, P4=0.831, 
P5=0.314, P6=PlyA" 
peptide_2_6_| 8 04_aa 

MEAPNYPIFFGLNLGVPLEGGRSGTYSGFGSSIiFGFRIFGVKDFSPIRLFLNYGSGSVRI 
LADSSRVFFRDRRRTKFRRNKNKMLARVCGFKCSSSPAESAARLFCTRSIRDTLAKASGE 
SCEAGFGGESLKLQSGFHEIKGLEDAIDLFSDMLRSRPLPSWDFCKIjMGWVRMERPDL 
VISLYQKMERKQIRCDIYSFNIIiIKCFCSCSKLPFALSTFGKITKLGLHPDVVTFTTIjLH 
GLCVEDRVSEALDFFHQMFETTCRPNVVTFTTLMNGLCREGRIVEAVALLDRMMEDGIjQP 
TQITYGTIVDGMCKKGDTVSALNLLRKMEEVSHI I PNVVI YSAI IDSLCKDGRHSDAQNL 
FTEMQEKGIFPDLFTYNSMIVGFCSSGRWSDAEQLLQEMLERKISPDWTYNAIilNAFVK 
EGKFFEAEELYDEMLPRGIIPNTITYSSMIDGFCKQNRLDAAEHMFYLMATKGCSPNLIT 
FNTLIDGYCGAKRIDDGMELIiHEMTETGLVADTTTYNTLIHGFYLVGDLNAALDLLQEMI 
SSGLCPDIVTCDTLLDGLCDNGKIjKDAIjEMFKVMQKSKKDLDASHPFNGVEPDVQTYNIL 
ISGLINEGKFLEAEELYEEMPHRGIVPDTITYSSMIDGLCKQSRLDEATQMFDSMGSKSF 
S PNWTFTTLINGYCKAGRVDDGLELFCEMGRRGI VANAITYITLI CGFRKVGNINGALD 
IFQEMISSGVYPDTITIRNMLTGLWSKEELK^VAMLEK^^ 
GVKRCLGVC P FC S CHHG YRQARS S 
(SEQ ID NO:51) 

CDS CDS_2 6 | 2 4 1 5_bp 
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atggaggcaccaaattatcctatattttttggacttaatcttggtgtacccctagagggt 

gggcgttcgggtacctattcgggtttcggttcgagtctattcggatttcggatttttggg 

gtcaaagattttagccccattcggttatttctaaattacggttcgggttcggttcggatc 

cttgcggattcgtcacgagttttttttagagatcgacgaagaacaaaatttaggcgaaac 

aaaaataaaatgttggctagggtttgtggattcaagtgttcttcttctcctgctgagtct 

gcggctagattgttctgtacgagatcgattcgtgatactctggccaaggcaagcggagag 

agttgcgaagcaggttttggaggagagagtttgaagctgcaaagtgggtttcatgaaatc 

aaaggtttagaggatgcgattgatttgttcagtgacatgcttcgatctcgtcctttacct 

tctgtggttgatttctgtaaattgatgggtgtggtggtgagaatggaacgcccggatctt 

gtgatttctctctatcagaagatggaaaggaaacagattcgatgtgatatatacagcttc 

aatattctgataaaatgtttctgcagctgctctaagctcccctttgctttgtctacattt 

ggtaagatcaccaagcttggactccaccctgatgttgttaccttcaccaccctgctccat 

ggattatgtgtggaagatagggtttctgaagccttggatttttttcatcaaatgtttgaa 

acgacatgtaggcccaatgtcgtaaccttcaccactttgatgaacggtctttgccgcgag 

ggtagaattgtcgaagccgtagctctgcttgatcggatgatggaagatggtctccagcct, 

acccagattacttatggaacaatcgtagatgggatgtgtaagaagggagalfactgtgtct 

gcactgaatctgctgaggaagatggaggaggtgagccacatcatacccaatgttgtaatc 

tatagtgcaatcattgatagcctttgtaaagacggacgtcatagcgatgcacaaaatctt 

ttcactgaaatgcaagagaaaggaatctttcccgatttatttacctacaacagtatgata 

gttggtttttgtagctctggtagatggagcgacgcggagcagttgttgcaagaaatgtta 

gaaaggaagatcagccctgatgttgtaacttataatgctttgatcaatgcatttgtcaag 

gaaggcaagttctttgaggctgaagaattatacgatgagatgcttccaaggggtataatc 

cctaatacaatcacatatagttcaatgatcgatggattttgcaaacagaatcgtcttgat 

gctgctgagcacatgttttatttgatggctaccaagggctgctctcccaacctaatcact 

ttcaatactctcatagacggatattgtggggctaagaggatagatgatggaatggaactt 

ctccatgagatgactgaaacaggattagttgctgacacaactacttacaacactcttatt 

ca cgggttctatctggtgggcgatcttaatgctgctctagaccttttacaagagatgatc 

tctagtggtttgtgccctgatatcgttacttgtgacactttgctggatggtctctgcgat 

aatgggaaactaaaagatgcattggaaatgtttaaggttatgcagaagagtaagaaggat 

cttgatgctagtcaccccttcaatggtgtggaacctgatgttcaaacttacaatatattg 

atcagcggcttgatcaatgaagggaagtttttagaggccgaggaattatacgaggagatg 

ccccacaggggtatagtcccagatactatcacctatagctcaatgatcgatggattatgc 

aagcagagccgcctagatgaggctacacaaatgtttgattcgatgggtagcaagagcttc 

tctccaaacgtagtgacctttactacactcattaatggctactgtaaggcaggaagggtt 

gatgatgggctggagcttttctgcgagatgggtcgaagagggatagttgctaacgcaatt 

acttacatcactttgatttgtggttttcgtaaagtgggtaatattaatggggctctagac 

attttccaggagatgatttcaagtggtgtgtatcctgataccattaccatccgcaatatg 

ctgactggtttatggagtaaagaggaactaaaaagggcagtggcaatgcttgagaaactg 

cagatgagtatggtatattattggtctgaactaaagaggcacaccttccagaagatttca 

ggtgttaaaagatgtttaggtgtctgcccgttctgtagctgtcaccatggttatcgtcaa 
gctcggtcttcatga 

(SEQ ID NO: 52) 

9ene complement ( join (175261 . .175266,176231. . 176253, 

176380. .177545,177619. .178394,180171. .180210) ) 
/gene= " Pept ide__2 7 " 

/note=»GenScan; PI = Prom, P2-0.639, P3=0.247, P4=0.863, 
P5=PlyA" 

peptide__27 | 654_aa 

MLARVYRSGSSSSPAVSAARLFCTRSIRHALAKKSRDGESGFGGESLKLRSGFHEIKGLE 
DAIDLFGDMVRSRPLPSVIDFCKLMGVVWMGRLDWISIjHRKMEMRRVPCNAYSFTILM 
KCFCSCSKLPFALSTFGKITKLGFHPTVVTFSTLLHGLCVEDRISEALDLFHQMCKPNW 
TFTTLMNGLCREGRWEAVALLDRMLEDGLQPNQITYGTIVDGMCKMGDTVSALNLLRim 
EEVSHI KPNWI WPLERRTCMINGFCSSGRWSEAQQLLQEMIjERKKI S PD WTYNAL I NA 
FVKEGKFFEAEELYDEMLPRGIIPSTITYSSMIDGFCKQNRLDAAEHMFYLMATKGCSPD 
IITFNTLIAGYCRAKRVDDGIKLLHEMTEAGLVANTITYTTLIHGFCQVGDLNAAQDLLQ 
EMVSSGVCPNVVTCNTLLDGLCDNGKLKDALEMFKAMQKSKMDIDASHPFNGVEPDVQTY 
NI L I SGIj I NEGKFLE AEEL YEEMPHRG I VPDTI TYNS VI HGLCKQS RLDEATQMFDSMGS 

KSFSPNWTFTTLINGYCKAGRVDDGLELFCEMGRRGIVANAITYITLIHGFRKVGNING 

ALDIFQEMMASGVYPDTITIRNMLTGLWSKEELKJ^AVAMLEDLQMSVGYQLEDE 
(SEQ ID NO: 53) 
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CDS CDS__2 7 | 1 9 6 5__bp 

atgttggctagggtttatagatccggatcttcttcttctcctgctgtgtctgcggctaga 
ttgttctgtacgagatcgattcgtcatgctctggccaagaaaagcagggatggagagagt 
ggttttggaggagagagtttgaagctgcgaagcggatttcacgaaatcaaagggttagaa 
gatgcgattgatttgttcggtgatatggtacgatctcgtcctttaccttcagtaattgat 
ttctgtaaattgatgggagttgtggtgaggatgggaaggctcgatgttgtgatttctctc 
cat aggaagatggaaatgaggcgggttccatgtaacgcatacagctt caeca tcctgatg 
aagtgtttctgcagctgctctaagctgccgtt tgett tgtctacatttggtaagatcacc 
aagcttggttttcatcccactgttgttaccttcagcaccctgctccacggattatgtgtg 
gaagacaggatctctgaagccttggatttgtttcatcaaatgtgtaaaccaaatgtcgta 
accttcaccacgctgatgaacggtctttgccgtgagggtcgagttgtcgaagctgtagct 
ctgcttgatcggatgctagaagatggtctccagcctaaccagattacttatggaacaatc 
gtggatgggatgtgtaagatgggagacactgtgtctgcat tgaatcttctgaggaagatg 
gaggaggtgagccacatcaaacccaatgtggtaatctggcctttggaaagacggarcctgt 
atgattaatggatttrtgtagctctggtagatggagtgaagcccagcagttgttgcaagaa 
atgttagaaaggaagaagatcagccctgatgttgtaact tataatgetttgatcaatgea 
t ttgtcaaggaaggcaaattctttgaggctgaagaattatacgatgagatgcttccaagg 
ggtataatccctagtacaatcacatatagttcaatgatcgatggattttgcaaacagaat 
cgtcttgatgctgctgagcacatgttttatttgatggctaccaagggctgctctccggac 
ataatcactttcaatactctcatagccggatactgtagagctaagagggtagatgatgga 
ataaaacttctccatgagatgactgaagcaggattagttgctaacacaattacttacacc 
actcttattcacgggttctgtcaggtgggcgatcttaatgctgctcaagaccttctacag 
gagatggtctctagtggtgtgtgccctaatgtcgttacttgtaacactttgctggacggt 
ctctgcgataatgggaaactaaaagatgcattggaaatgtttaaggctatgcagaagagt 
aagatggatattgatgctagtcacccctttaatggtgtggaacctgatgttcaaacttac 
aatatattgatcagtggcttgatcaatgaagggaagtttttagaggccgaggaattatac 
gaggagatgccacacagaggtatagtcccagatactatcacctataactcagtgatccat 
ggtttatgcaagcaaagccgcctagatgaggctacacaaatgtttgattcgatgggtagc 
aagagcttctctccaaacgtagtgacctttactacactcattaatggatactgtaaggca 
ggaagggttgatgatgggctggagcttttctgcgagatgggtcgaagagggatagttgct 
aacgcaattacttacatcactttgattcatggt tttcgtaaagtgggtaatattaatggg 
gctctagatattttccaggagatgatggcaagtggtgtgtatcctgataccattactatc 
cgcaatatgctgactggtttatggagtaaagaggaactaaaaagggcagtggcaatgctt 
gaggatctgcagatgagtgtgggatatcagttggaggatgaataa 
(SEQ ID NO: 54) 

complement (join (1805 97 . . 180602 , 18 0714 . . 181634 , 
181896 . . 182 003 , 18216 9 ... 1822 08) ) 
/gene="Peptide_28 " 

/note="GenScan; PI = Prom, P2=0.998, P3=0.959, P4=PlyA" 
peptide_2 8 | 34 2_aa 

MRGGASGNNVLETINAAATAFASSDDRVHHQPSPIHRRKRIGKAALAPEPVPTDSTSNSG 
YRSVMTALPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPSNSHNNNNNSEERPSIFA 
IGPYAHEPQLVSPPVFSTYTTEPSSAPVTPPLDESFYLTTTTPSSPEVPFAQLFNSSSNY 
GVRS PVSNYEFQF YQLPPGS PLAQLI S PS S VMSGSGATS PFPDGLAQFQVSDP PKLLS PG 
KLRCSKSVTTPKEONKIVRPNKPVSFDLDADHFIRCVDKKIiRTTFPEASDQEAAQHSSSG 
SNKEFDFGTTDEIHLTGDDEHRDSTKNSSDWSFPVMQSGTLS 
(SEQ ID NO: 55) 

CDS_28 | 102 9_bp 

atgagaggcggcgcgagtggaaacaacgttttggagactataaacgcagccgctactgcg 
ttcgcttcctctgatgatcgtgttcatcaccaaccttccccgattcatagaagaaaacga 
atcgggaaagctgctcttgctcctgaaccggttcctaccgattccacatccaattccggt 
tatcgttcggttatgacggctcttcctttcatagccccaccttc'ctctccagcttccttc 
ttccaatcagaacctccttccgctacacagtcacctgtagggatcctctcct ttagtcct 
ctaccttctaacagccacaacaacaacaacaacagcgaagaacgtccttcgatcttcgcc 
atcggaccttacgctcacgaacctcagctggtttctcctccggttttctctacttacaca 
accgaaccgtcttcagctccggtcacgccgcctctcgacgagtctttctacttaaccacc 
accacaccgtcttcgcctgaagtccctttcgctcagctctttaactccagcagtaactac 
ggtgtcaggtctccggtgtctaactacgagtttcagttttaccaacttcctcccggtagt 
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ccactcgctcagcttatctcccccagctcggttatgtccggttctggtgcgacttctccg 

tttcctgacggactcgctcagtttcaagtctctgatccaccaaagctgctgagccctggt 

aaactgcgttgctccaagtctgttacaactcctaaagagcagaacaagattgtgagaccg 

aacaaaccggtttcgttGgatcttgatgcggatcatttcattagatgcgttgataagaag 

ctgagaacaacgttccctgaagcgtctgatcaagaagcagctcaacattcctcctccgga 

tccaataaagaattcgatttcggcaccaccgatgagatacatttgaccggtgatgatgag 

catagagattcgaccaagaacagcagcgattggtccttccctgtgatgcaatcaggcaca 
cttagctaa 

(SEQ ID NO: 56) 

gene complement (join (182434 . . 18243 9, 182881 . . 183285 , 

184 053 . . 184 094, 186762 .. 1868 01) ) 
/gene= " Pept ide_2 9 " 

/note="GenScan; Pl=Prom, P2=0.180, P3=0.217, P4=PlyA" 
peptide_29 | 148_aa — 

MKSSHRAKQVEIGFFWREITFFRFWRKITFFRFWRENAFSGFHEICLRFSGFGGKLRFSGF 

GGKITFSGFDEKLCFSGLDGKLRFSIIiVGMYVCRFWREIVFFGFDGKLRFLGFGVKVCFL 
VLARNWVFRFLiVGNCLFP I LAG YC I FRF 

(SEQ ID NO:57) 

CDS CDS_2 9 | 4 4 7_bp 

atgaagtcaagccaccgtgcaaaacaagttgaaattggctttttttggcgagaaattacg 
tttttccggttttggcggaaaattacgtttttccggttttggcgggaaaatgcgttttcc 
ggttttcacgagaaattacgtttttccggttttggcggaaaattacgtttttccggtttt 
ggcgggaaaattacgttttccggttttgacgagaaattgtgtttttccggtttggacggg 
aaattgcgtttttcgattttggtgggaaattatgtttgtcggttttggcgggaaattgtg 
tttttcggttttgacggaaaattgcgttttctcggttttggcgtgaaagtgtgttttttg 
gtcttggcgagaaattgggttttccgatttttggtgggaaattgcctttttccgattttg 
gcgggatattgcatttttcgattttga 

(SEQ ID NO: 58) 

9 en e join (187074. .187113,187493. .187526, 188534. .18 8733, 

18 93 97. .18 9500, 189760. .1902 60,190331. .191063, 
191102. .191473,191583. .191882,192117. .193246, 
193319. .193375,194106. .194333,194405. .195500, 
195582. .195881,196902. .198149,19843 6. .198441) 
/gene= " Pep t ide_3 0 *' 

/note="GenScan; PI = Prom, P2=0.436, P3=0.402, P4=0.107 
P5-0.460, P6-0.572, P7-0.532, P8=0.954, P9=0.471, 
P10=0.400, Pll=0.949, P12=0.995, P13=0.193, P14=0.202 r 
P15=PlyA" 

peptide_3 0 | 2100_aa 

MIRTEPEFQREVFKEEDLQHEQYRQICFLFSTTSQEVEIIECGVQIVRDRSGRKRSDADA 
VNINHNNNCWKTTMKVSTKEMKFPSKEVSPEQEESRTSQKRQQQGSIKSSSFRDFHRYSA 
LMAS S S S S S S PRTWRYRVFTS FHG PDVRKTFLTHLRKQFNCNG I SMFDDQG I ERGHT I AP 

ALTQAIRESRISIWLTKHYASSRWCLDELLGILKCKEEIGQIVMTIFYGVDPSDVRKQT 

GDFGKVFKDTCRRKTEEERRRWSQALTDVGNIAGEHFLNWDKESEMIEKIARDVSNKLNA 

T I S RDFEDMVG I EAHLD KMQS LLHLDDEDGAMFAG I CG PAG I GKTTI ARALHS RLS S S FH 

LTCFMENLRGSCNSGLDEYGIiKLRLQELLLSKI FNQNDMRI YHLGAI PQRMCDQKVLI IL 

DDVDDLQQLEALADETNWFGDGSRIVVTTEDQELLEQHGINNTYYVDLPTDDEARKIFCR 

YAFRRSLTPYGFETLVERTTELCGKLPFGLRVQFYAERKKTTGKIDAVLRVGYDSLHENE 

QTLFLLIAIFFNYQDDGHVKTMLADTNLDVRLGLKTLAYKSLTKISSQGKIVMHKLLQQV 

GRQAVQRQEPWKRRILIDPQEICDVLEPWKRQVLTDTDEIRDVLENDSGSRNLMGVSFDM 

STILHDMDISARAFTSMRNLRFLKVYKTRCDTNVRVHLPEDMEFPPRLRLLiHWEVYPRKF 

LPRTFCTEHLVELYLRDTELEQLWEGTQPLTNLKKMFLGSCLYLKELPDLAKATNLEKLR 

LDRCRSLVEIHSSVGNLHKLESLEVAFCYNLQVVPNLFNLASLESFMMVGCYQLRSLPDI 

STTITELSIPDTLLEEFTEPIRLWSHLQRLDIYGCGENLEQVRSDIAVERIPDCIKDLQR 

LEELTIFCCPKLVSLPELPRSLTLLIVYECDSLETLAPFPLGSEIEALSFPECFRLDREA 

RRVI TQLQSSWVCLPGRNI PAEFHHRVIGNFLAI CSNAYRFKLCAWSPKQVMVEDEDI E 

LLCHILI NGCPMKSP I KS I YNLRI RIQSEHLFI FPSTMLKEDRQLGQYSE ILFKFSTTSQ 

NTEI I KCGVQ I LRDRRS CDS KSEQDDDESLYSS LDYDAPRVDT I NLLKEHKDLISDYFTS 
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FFSLWCLDELLGILKCKEEMGQIWITIFYGVDPSDVRKQTGDFGKVFKETCRRKTEEERR 
RWSQALTDVGNI AGEHFLNWDKESEMI EKI ARDVSNKLNATI SRDFEDMVGI EAHLDEMN 
SLLHLDDEDGAMFVGICGPAGIGKTTIARALHSRLSSTFQHTCFMENLRGSCNSGTDEYG 
LKL.RLQELLLSKI FNQNGVKXiFHLGAI KERLCDLKVLI VLDDVDDLQQLEALADDTNWFG 
DGSRIIVTTEDQEILEQHGISNTYRVDFPTQVDARQIFCRFAFRQLSAPHGFEKLVDRVI 
KLiCSNLPLGLRVMGS S LRRKKVDDWEG I LQRLENS FDQKI DAVLRVG YNSLHKDDQFLFL 
L I ACFFNYKDDDHVKANLVDSNLDVRLGLKNLVYKSL I Q I S AEGTI VMHKLLQQVGREAV 
HLQDPRKRQI LIDSHQI CDVLENTDSDGTSVMGISFDTSTI PNGVYI SAQGFRRMRDLRFL 
SIYETRRDPNVRVHLPEDMSFPPLLRLIiHWEVYPGKCLPHTLRPEHLVELCFVNSMLEQL 
WQGVQPLTNLKKMDLSGSLSLKEVPDLSNATSLKRLNLTGCWSLVEI PSSIGDLHKLiEELi 
EMNLCVSVQVFPTLLNLASLESLRMVGCWQLSKIPDLPTNIKSLWGETMLQEFPESVRL 
WSHLHSLNI YGSVLiTVPLLETTSQEFSIiAAATIERI PDWIKDFNGLRFLYIAGCTKLGSL 
PELPPSLRKLIVDNCESLETVCFPCDTPTTDYLYFPNCFMLCQEAKRVITQQSLRAYFPG 
— KEMPAAEFDDHRSFGSSLTIIRPAICKFRICLVLSPAPDMEEAYEICLLFRIRAKGCPSDD 
— - DMLSLDIiAX-IQGEHLFIFHIEFVEHHEEMVFKFSTSSHEVDVIECGVQVLTDETSRRSNE 

SCSEQVSEDGDDILSDDDKSNEIYEPRVKIFTGYTMFLSLVFTFLLSLISSLILYRF-fcKN 
(SEQ ID NO:59) 

CDS CDS_3 0 | 63 03_bp 

atgataagaacagaacctgaatttcaaagagaagtgtttaaggaagaagacctacagcat 

gaacaatatcgccagatatgtttcctgttcagcaccacatcccaggaagtcgaaattatt 

gaatgtggcgtccagatcgtgagggacagaagtggcagaaaaagaagcgacgccgatgct 

gtgaatataaatcacaacaacaactgt tggaagacgacgatgaaagtct ctacgaaagag 

atgaagtttcct tcaaaagaagtctcgcctgaacaagaagaaagcagaacaagccaaaag 

cgacaacaacaaggatccataaaaagttccagt ttcagagactt teat cgat act ctgct 

ctcatggcttcttcttcctcttcttcctcgcctcgcacatggagataccgcgtcttcacg 

agcttccacggacctgacgtccgcaaaaccttcctcactcacctacgcaagcagtttaac 

tgcaacgggatatcgatgttcgacgat caagggatcgagagaggccacaccatcgcccct 

gctctcacacaagcgatcagagaatcgaggatctctatcgtggtgttaacgaagcactat 

gcttcttccaggtggtgtttggatgagcttttggggattctgaaatgcaaggaagagatc 

gggcagatagtgatgaccatcttctacggagtagatccttccgatgttcggaaacaaacc 

ggagatttcgggaaagtcttcaaggacacatgccgt egtaaaacagaggaagagaggega 

agatggagccaagctttgaccgatgtgggaaacatagctggggaacactttctcaactgg 

gacaaggaatcggagatgattgaaaagattgcgagagatgtctcaaacaagcttaatgct 

acaatctctagggattttgaagacatggttggtattgaagcacacttggataagatgcag 

tctttgttacatttagatgatgaggatggagctatgtttgctggaatctgtggccctgct 

ggcattggtaagactaccattgctagggctctacatagtcgactctctagcagttttcat 

cttacttgttttatggagaatcttcgaggaagctgtaatagtggtctcgacgagtatgga 

t tgaaactgcgtttacaagagctacttctttcaaagatttttaaccagaatgatatgagg 

atataccatttaggtgcgataccgcaaagaatgtgtgaccaaaaagttcttatcattctt 

gatgatgtggacgatctgcagcagcttgaggctctggctgatgaaactaactggtttggt 

gatggaagcaggattgtggtgaccacggaagatcaagagcttttggagcaacatggtatc 

aacaatacatactatgtggatcttccgactgacgacgaggctcgtaagattttttgtaga 

tatgctttcagacggagcttaacaccatatggttttgaaactcttgtcgaaagaacaaca 

gagctttgtggcaaacttCGt-tttggtctccgtgttcaattttacgcggaaagaaagaag 

acgactgggaaaatcgatgcagtacttagagttggatacgacagtttacatgagaacgaa 

caaactctgtttctcctcattgccatcttcttcaactaccaagacgatggtcacgtgaaa 

acaatgctcgctgacactaacttggatgtcagactcggcttgaaaactctcgcttataag 

tctctcacaaaaatat ctagccaaggaaaaatagtgatgcacaagttactacaacaagtg 

ggcagacaagcagttcaaagacaagagccttggaaacgtcggatcttaattgatcctcaa 

gagatctgcgatgttcttgagccttggaaacgccaagtcctaactgataccgatgagatt 

cgcgatgtccttgaaaatgattctggtagtagaaatttgatgggggtatcttttgatatg 

tctacaatcttacacgacatggatattagcgcaagagcttttacaagtatgcgtaatctt 

cgatttctcaaggtctacaaaacaagatgtgatacaaatgttagagtgcatttacccgag 

gacatggagtttccacctcgtctgaggttattacactgggaggtatacccgagaaagttt 

cttcctcgtacattttgtactgaacatcttgtggaactctatttaagagataccgagctc 

gagcaattatgggagggaacccagcccctcacaaatctcaagaagatgtttttgggttcg 

tgcctgtatcttaaggaacttccggatcttgcaaaagctacaaacctagagaaattgagg 

ttggatcggtgcaggagtttggtagagattcattcctctgttggaaaccttcataaacta 

gagagtttggaagtggctttctgttataatctacaggttgttccgaatcttttcaacttg 

gcatctcttgaatcattcatgatggtgggatgctaccaactgaggagtcttccagatatt 
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t ctacgaccatcacagaactctcaatcccagacacactgt tagaagagtttactgaacca 

attaggctctggtctcaccttcagagactcgatatatatggctgtggggaaaatttggag 

caagtgcgaagtgacatagctgttgagagaattccagactgcatcaaagatctccaacgg 

ttagaagaactaactatattttgctgtccaaaacttgtatcactgccagagctccctagg 

tcgctcacattactaatagtatacgaatgtgattcactggagacactagcacctttccct 

ttaggttctgagattgaagctctctcttttcccgaatgcttcagattggatcgagaagca 

a gg a g a gtaattacccagctgcaatcatcatgggtatgcctacctggaagaaatatacct 

gcggagttccatcaccgggttataggaaatttcttggccatatgctcaaatgcataccga 

tttaagctttgtgccgtggtttcccctaaacaggtgatggtggaagatgaagatatagaa 

ttactgtgtcacatactcataaatggttgccccatgaagagccccattaagagcatatat 

aatcttaggattagaatccaatcagaacatctgtttatatttccctccacaatgctcaag 

gaagaccgacagcttggacaatacagtgagatattgttcaaatttagcaccacatctcag 

aacactgaaattattaaatgtggtgtccaaatcttgagggacagaagaagctgtgattct 

aagtcagaacaagacgacgatgaaagtctctacagcagecttgactatgatgcaccaaga 

gtagataccattaacttactaaaggaaeataaagacttgataagtgattatttcacatca 

ttcttttccttgtggtgtttggatgagcttttgggaattctgaaatgcaaggaagagatg 

gggcagatagtgatgaccatcttctatggagtagatccttctgatgttcggaaacaaacc 

ggagatttcgggaaagtcttcaaggaaacatgccgtcgcaaaacagaggaagagaggcga 

agatggagccaagctttgaccgatgtgggaaacattgccggggaacactttctcaactgg 

gacaaggaatcggagatgattgaaaagattgctagagatgtctcaaacaaacttaatgct 

accatctctagggactttgaagacatggttggtattgaagcacacttggatgagatgaac 

tctttgttacatttagatgatgaggatggagctatgtttgttggaatctgtggccctgca 

ggcattggcaagactaccattgctagggctttacatagccgactctccagcactttccag 

catacctgttttatggagaaccttagaggaagctgtaacagtggtactgacgagtatgga 

ttgaagttgcgtttacaagagctacttctttccaagatttttaaccaaaatggtgtaaaa 

ctatttcatttaggtgccataaaggaaaggttatgcgacctaaaagttcttatcgttctt 

gatgatgtggacgatctgcagcagcttgaggctttggctgacgatactaactggtttggt 

gatggaagcaggattatcgtaaccacggaagatcaagagattttggagcaacatggtatc 

agcaatacataccgtgtggatttcccaactcaagtagatgctcgtcagatcttttgtaga 

tttgcttttagacagctctctgcaccccatggttttgaaaaacttgttgacagagtaata 

aagctttgcagcaaccttcctttgggtctccgtgtcatgggctcgtctttacgcagaaag 

aaagtagacgactgggaaggtatactgcagagactagagaatagctttgatcaaaagatt 

gatgcagtacttagagtcggatataacagtttgcataaggatgaccagttcctgtttctc 

ctcattgcatgcttcttcaactacaaagacgatgatcacgtgaaagcaatgctcgttgat 

agtaacttggatgttagactcggcttgaaaaatctcgtgtataaatctctcatacagata 

tccgcagaaggaaccatagtaatgcacaagttattacaacaagtgggtagagaagcggtt 

catttgcaagatcctaggaaacgccaaatcctaatagattctcaccagatttgtgatgtc 

ctcgaaaatgattctgatggtacaagtgtgatgggtatat cctttgatacatccacaatc 

ccaaacggagtgtatataagcgcgcaagggtttagacgaatgcgtgatcttcggtttctc 

agcatctacgagacaagacgtgatcctaatgttagagtgcatttacctgaggacatgagt 

tttccacctcttctaaggttattacactgggaggtatatccaggaaagtgtcttcctcat 

actcttaggcccgaacatcttgtggaactctgttttgttaacagcatgctcgagcagctt 

tggcaaggagtccagcctcttacaaatctcaagaagatggacttgtccgggtcactgagt 

ttgaaggaagtccctgatctttcaaatgctacaagtctcaagagattaaatctaacaggt 

tgctggagtttggtagagattccttcctctattggagaccttcataaactggaggagctg 

gagatgaatttatgtgtaagtgtacaggtttttccgactctactcaacttggcatctctt 

gaatcactcaggatggtaggatgctggcaactaagcaaaattcctgatcttccaaccaac 

atcaaatcacttgtagtcggggaaacaatgctacaagagtttcctgaatcagtcaggctt 

tggtctcaccttcatagtctcaacatatatggcagtgtcctcacagtgccacttttggaa 

accacgtcgcaagaattctccctcgctgctgctacgattgagaggattccagattggatG 

aaagattttaatgggctaaggtttctttacatagctggctgcacgaaacttggatcactg 

ccagagctccctccctcgcttagaaaactaatagtagacaactgtgagtcactagagacg 

gtctgttttccttgcgacactccaacaactgattatctctacttccccaactgcttcafcg 

ttgtgccaagaagcaaagagagtaatcacacagcaat cat tgagagct tact tcccagga 

aaagagatgcctgctgcagagttcgatgatcatcgatcatttggaagttccttgaccatc 

atccgtccggctatctgcaagtttaggatttgcctggtgctttctcctgcaccggatatg 

gaagaagcttatttcaaattactgtttcgcatacgtgcaaaaggttgtcccagtgatgat 

gacatgctttcgttagatctcgctaaaatccaaggggagcatctttttatatttcacatt 

gagtttgttgaacatcacgaggagatggtgttcaaattcagcacctcatcccacgaagtc 

gacgttattgaatgcggtgtacaggtcttgacagatgaaaccagcagaagaagcaatgaa 

tcttgttcagaacaagtgtctgaagacggggatgatattctatcggatgatgataagagc 
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aatgagatt tatgaacccagagtaaagatatttacgggt tatacaatgtttctttct tta 
gtttttacgttccttttgagtttgatttcaagtctaattctgtatagattcctaaagaac 
taa 

(SEQ ID NO: 60) 

gene complement ( join ( 198625 . . 198630,198733 . . 198813, 

198 944 . . 199 015 , 199100 . . 199216 , 19 9253 . . 199414 , 
199503 . . 1998 56 , 200002 . . 2 0014 8 , 20 02 52 . . 20 03 90 , 
20 0469 . . 2 00 617, 2 0 0760 . .2 00 928 , 201066 . . 2 01146 , 
201231. .201340,20156 6. . 2 01577 , 201943 . . 2 02 050 , 
202 075. .2 02155,2 02241. .202304,202431. .202456, 
202555. .202641,202727. .2 02754,2 02867. .202928, 
20 3 018 . .2 03 230,203 316. .203 423,20 3 511. .203 585, 
2 03-94:2-. . 2 04018 , 204139 .. 204214 , 204308 . . 2 0A370, 

2 054 71 . .2 05510) ) 

/gene = " Pept i de__3 1 

/note="GenScan; Pl-Prom, P2=0.989, P3=0.989, P4=0.975, 
P5=0.879, P6=0.985, P7=0.993, P8=0.437, P9=0.433, 
P10=0.990, Pll=0.922, P12=0.859, P13=0.999, P14=0.421, 
P15=0.824, P16=0.996, P17-0.683, P18=0.979, P19=0.983, 
P20=0.999, P21=0.994, P22 = 0 . 934 , P23=0.729, P24=0.737, 
P25=0.990, P26=0.966, P27=PlyA" 
peptide_31 | 88 6_aa . 

MDAPKE I FLKDYTKPDYYFETVDLS FSLGEEKT I VS SKI KVS PRVKGSS APLVIjNGHDIjK 
LliSVKVEGKLLKGLYKSSGNFCTQCEAEGFRKITFYQDRPDIMAKYTCRVEADKSLYPVL 
LSNGNLISQGDVEGGRHFALWEDPFKKPCYLFALVAGQIiASRDDTFTTRSGREVSLKIWT 
PAEDLPKTAHAMYSLKAAMKWDEDVFGLEYDLDLFNIVAVPDFNMGAMENKSLNIFNSKL 
VLASPETATDADYAAILGVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDMG 
SRTVIO^IADVSKIjRIYQFPQVTTWVYLVFSKPILCCLIIQDAGPMAHPWPHSYIK^EK 
GIDLYFQRHDEQAVTCEDFFAAMRDANNADFANFLQWYSQAGTPVVKVASSYNAEARTFS 
LKFSQEIPPTPGQPTKEPTFIPVWGLIiDSSGKDITLSSVYHNGTLQTISSSSTILRVTK 
KEEEFVFSDISEKPVPSLFRGFSAPVRVETDLSDDDLFFLLAHDSDEFNRWEAGQVLARK 
LMLNLVSDFQQNKPLVLNPKFIQGLGSVLSDSSLDKEFIAKAITLPGEGEIMDMMAVADP 
DAVHAVRKFVRKQLASELKTELLKIVEI^RSTEAWFDHPNMARRAIiKNTALGWHLRVII 
KIjNESIjFVRKSLTSLCVCFVEIMPAYIjASLEDPAYVEIiALGEYKSATNLTDQIAAIiAALiA 
QKPGQTRDEVLADFYNKWQGDYLVVNKWFLLQASSDIPGNVENVKKIjLDHPAFD 
ASEFSKLLFVPQLSFSDVYSLIGGFCGSPVNFHAKDGSGYKFLGDIWQLDKINPQVASR 
MVSAFSRWKRYDETRQALAKAQLEMIMSANGLSENVFEIASKSLAA 
(SEQ ID NO:61) 

CDS CDS_3 1 | 2 6 6 l_bp 

atggatgcacctaaggaaatctttctcaaggactacaccaagcctgattactactttgaa 
actgtggatctaagcttctctctaggtgaagagaagacaattgttagctccaagatGaaa 
gtttcccctcgagttaaaggatcttctgc.tcccttggtettgaatgggcatgacttgaag 
ctactctctgtcaaggttgaggggaagcttctaaaggggctctacaagtcatctgggaat 
ttttgcacacaatgtgaagcagagggtttccggaaaatcacattttaccaggaccgtcct 
gatattatggcgaagtacacatgccgtgttgaagctgacaagtcactctatcctgtactg 
ttgtccaacggaaacctcatttctcaaggagatgtagagggaggtcggcactttgcctta 
•tgggaggatcctttcaagaagccatgctatctatttgctctggtggctggacagctagcg 
agcagagatgatacatttactacacgctctggtagggaggtatctctgaaaatctggact 
cctgcagaagatctaccaaagactgctcatgccatgtattctctgaaggcggccatgaag 
tgggatgaagatgtgtttggccttgagtatgacctggatctcttcaacattgtcgccgtt 
ccagattttaacatgggagccatggaaaacaagagtttgaacatttttaattccaagcfet 
gtcctggcatctccagaaactgcaacagatgcagattatgctgcaattttgggagttatt 
ggtcatgaatacttccacaattggacaggcaacagggtgacatgccgtgactggttccaa 
ctcagtctaaaggaaggtctaactgtcttccgtgatcaggagttttcatctgatatggga 
agccgtactgtaaagcgtattgctgatgtttcaaagctcaggatctatcaattcccgcag 
gtcactacatgggtttatctcgttttttctaagccaatcctctgttgtttaattatccag 
gatgctggtcctatggcacatcctgttcgcccacattcatacatcaaggtttatgaaaag 
ggtattgatctttattttcaaagacatgatgagcaagctgtgacctgtgaagacttcttt 
gctgctatgcgtgatgcaaacaatgcagattttgctaatttcttgcaatggtactctcaa 
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gctggaacgccagtcgtcaaagtggcatcctcttacaatgctgaagctcgtaccttctct 
ttgaaattcagtcaggagatacccccgactccgggccagccaacaaaagaaccaacattt 
attccagtggttgttggtcttctggactcaagtggaaaagacattacactttcctctgtt 
taccataatggtacacttcagaccatttcaagcagcagcacaatccttcgagtgaccaag 
aaagaagaagagtttgtgttctctgatatatcagaaaaacctgtgccgtctctgtttaga 
ggattcagtgccccagttcgtgttgagactgatctctctgatgatgatctgttcttcctc 
ttagcccatgattcagatgaattcaatcggtgggaggcaggtcaagttctggcaaggaag 
ctcatgctgaacttagtttctgatttccaacaaaacaaaccattggttctaaacccaaag 
ttcattcagggtctcggcagcgtgctttctgactcgagcttggacaaggaatttatagcc 
aaggcaataacattacctggggagggagagattatggacatgatggccgtggcggatcct 
gatgctgttcatgccgttagaaagtttgttagaaagcagcttgcatctgaacttaaaact 
gagcttctaaagatagttgagaacaataggagcactgaggcttatgtctttgaccacccc 
aatatggctaggcgtgctttgaagaacactgctctaggttggcatttgcgagtcattatt 

aagttaaacgagtctctctttgtgcggaagtcactaacttctttgtgtgtttgctttgtt 

gaaataatgccagcttatcttgcatcgcttgaggatccagcatacgtggaacttgcattg 

ggtgaataGaagtcggccaccaatttgacagaccaaattgctgctttggcagctcttgca 
caaaagccgggtcaaacccgtgatgaggttcttgctgatttctataacaagtggcagggc 
gattacttggttgttaataaatggttcctccttcaagcatcatccgacattcccgggaat 
gtggagaatgtcaagaagcttttggatcacccagcttttgatctgcgcaatccaaacaag 
gcaagtgaattctcaaaactcttattcgttccgcaactttctttctcagatgtttactcg 
ctcattggagggttctgcggttcgccagtgaatttccatgcaaaagatggatcaggttac 
aagttcttgggtgacattgttgtccagttagacaaaatcaatcctcaggttgcttctcgt 
atggtatctgcgttttcgaggtggaagcgctacgatgaaacccgacaagctctggccaag 

gcacagttggagatgataatgtcagctaatgggttatctgaaaatgtatttgagattgcc 
tctaagagcttggctgcttga 
(SEQ ID NO: 62) 

gene complement (join (20 5579. .2 05584, 20 62 16. .206365, 

206457. .206579,206721. .206831,20 7148. .207282, 
,207361. .207513,207594. .207668,207765. .207842, 
207943. .208007,208105. .208216,208295. .208405, 
208 557 . . 2 08655 , 20872 6 . . 208765) ) 
/gene = " Pept i de_3 2 " 

/note="GenScan; Pl=Prom, P2=0.971, P3=0.989, P4=0.872, 
P5-0.978, P6=0.976, P7=0.972, P8=0.999, P9=0.936, 
P10-0.941, Pll=0.981, P12=0.999, P13=PlyA" 
peptide__32 | 4 03_aa 

MRGLVKKiVSRSLSVSGKWQHQQIiRRLNIHEYQGAELMGKYGVNVPNGVAVSSLDEVKNA 
IQQVFPNETELWKSQILAGGRGLGTFKSGLQGGVHIVKRDQVQDIAGKMLGQVLVTKQT 
GPQGKVVSKVYLCEKLSLVNEMYFSI ILDRKSAGPLI I ACKKGGTS I EDLAEKF PDMI IK 
VP I DVFAG I TDEDAAKWDGLAPKAADRKDS I EQVKKLYELFRKSDCTMLE I NPLAETST 
NQL VAADAKLN FDDNAAF RQKE I FALRD PTQED PRE VAAA KVDLNYI GLDG E I GCMVNGA 

GI^QPWTSLNCTWEAFKILTSDDKVKAILVNIFGGIMKCDVIASGIVNAAKEVSLK^P 
WVRLEGTNVEQGKR I LKESGMKL I TADDLDDAAEKAVKALAN 
(SEQ ID NO:63) 

CDS CDS_3 2 | 1 2 1 2_bp 

atgaggggattggtgaagaagctcgtgtccagatctctctccgtctccggcaaatggcag 
caccaacagcttcgccgtctcaacatccacgagtatcagggagcagagctgatgggtaaa 
tacggagtaaacgtgcccaacggagtcgctgtctcttccctcgatgaagtcaaaaacgct 
atccaacaagttttccctaatgaaaccgagttggtcgttaagagccagatcttggctggt 
ggaagaggtctcgggactttcaagagtggtcttcaaggtggtgttcacattgtcaaacgt 
gatcaggttcaagatattgctggcaagatgcttgggcaagttctcgtcaccaaacaaaGt 
ggtcctcaaggcaaagtagtcagcaaggtctacttgtgtgagaagttgtcactcgtcaat 
gagatgtacttttccattattctcgaccgtaaatctgctggaccgcttataattgcctgt 
aaaaagggtggtaccagcattgaagatcttgctgagaagttccctgacatgattattaag 
gtaccgatcgatgtatttgcaggtattacagatgaggatgctgccaaggttgtggatggt 
ctggctccaaaagctgctgacagaaaagattcgattgaacaagtgaagaagctttacgaa 
ctcttccgcaagagtgactgcactatgttggaaatcaaccctctcgctgagacatccacg 
aaccaattggtagctgctgatgctaagttgaactttgatgataatgctgctttccgtcag 
aaagagatttttgcccttcgtgatccaacgcaggaggatccacgagaggtggctgctgca 
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aaagtggacctgaactatatcggtttagatggagagattggctgcatggtgaatggtgct 
ggattggccatgcaaccatggacatcattaaactgcacggtggtggaggcgtttaagata 
ctgacatcggacgataaagtgaaagcaatat tggtgaacatattcggtgggataatgaaa 
tgtgatgtgattgctagtggaattgtgaatgctgctaaagaggtgtcactgaaagtacca 
gtggtggttcgtctggaaggaacaaatgttgaacaaggaaagagaatcctcaaggaaagt 
ggaatgaaactcataacagctgatgatttggatgacgcagcagagaaagctgtgaaggca 
ttagctaattag 
(SEQ ID NO: 64) 

gene complement ( join (208954 . .208959,209570. .209769, 

210 814 . .210874,211045. .211084) ) 
/gene="Peptide_3 3 " 

/note="GenScan; Pl=Prom, P2=0.592, P3=0.647, P4=PlyA M 

peptide_3 3 | 8 6_aa 

MEVFRNHKKLIFTKPESFEKIAPKLKFLGLLAARIETECEGASEIGLTRGLRRWIRFVES 
CDLIYYASSGSLRFERTNCLKSSDEF ' 
(SEQ ID NO: 65) 

CDS CDS_3 3 |261_bp 

atggaagttttcagaaaccataaaaagttaattttcacaaaaccggaaagttttgaaaaa 
attgcgccaaaattaaagtttttagggttactggcggcgcgtattgaaacagagtgtgag 
ggagcgtccgagatcggattgacacgaggtttacggcgttggattcgtttcgtcgagagc 
tgcgatttgatatattacgcgtcgtctggatccttacggtttgagagaacaaactgtttg 
aagtcgtcggatgagttttag 

(SEQ ID NO: 66) 

gene join (211441 . .211480,211596. .211931,213462. .213530, 

213537. .213542) 
/gene="Peptide_34 " 

/note="GenScan; Pl=Prom, P2 = 0.243, P3=0.206, P4 = PlyA" 
pept ide_34 | 134_aa 

MNTAIVLKIRGLDRCIIKGEYAAENEKYGAITIIRQHbTEDLRDQYLNIANPLDLRTELK 
SRYTIVSLPK^ITQMGLLFEWINLRFRDFRSVDEYNSALIKIVSKLKLCGEEHCLRQNER 
SHETSSRESDQKNV 
(SEQ ID NO: 67) 

CDS CDS „ 3 4 | 4 0 5_bp 

atgaacactgcaat tgtcctgaagataagaggacttgacagatgtatcatcaaaggcgag 
tatgcagctgaaaatgaaaaatatggggcaataacaattattcgccaacatctcactgag 
gatctcagagatcagtatctaaatattgcgaaccctctagaccttcggacagagttaaaa 
tccagatacacaatagtgtcattaccaaaatttataactcaaatgggtcttttatttgag 
tggataaatctcagatttcgggactttaggtccgtagatgaatataactcagctctaatc 
aaaatcgtttctaaattgaaactatgtggtgaagagcattgtctacgacaaaatgaaaga 
— agtcatgagacatcatccagagaaagtgaccagaagaatgtctga 
(SEQ ID NO: 68) 

gene 35 complement (join (213617 .. 213622 , 213659 . .213834, 

214 8 93 . .214 958 ,217045 . . 217 060 , 217158 .. 217197) ) 
/ gene = " Pep t i de_3 5 " 

/note="GenScan; Pl=Prom, P2=0.242, P3=0.100, P4=0.191, 
P5=PlyA" 

peptide_3 5 | 85_aa 

MIHQPVPKlJLrPKKFIVSFAIJNRVIPIJIJHLWTIWSYLVDELlLVILCNFAGFIJELHLD^WCT 
CSFDADHSTMSS CSGRMDDAQS VLG 
(SEQ ID NO:69) 

CDS CDS_3 5 |258_bp 

atgattcatcaaccagtgccgaagctcttacccaagaagttcatagtttcgttcgctctc 
aatcgagtgattccacttctgcatctttggacgatgcattcctatctggtcgatgagctt 
cttgtaatactttgtaatttcgccggctttctggaacttcatttggacatggtcaagacg 
tgctcctttgatgctgaccactcgacaatgtcttcatgttctggtcggatggatgatgct 
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caatcagtactcggatag 
(SEQ ID NO: 70) 

gene complement (j oin (2178 77 . .217882,218055. .218153, 

2182 72 . . 218 3 59, 218463 .. 218569, 218660 . .218749, 
218827. .218 970,21950 9. .219621,219923 . .220006, 
22 0 093 . .2 202 04 , 22 094 9 . .22 0 988) ) 
/gene= " Pept i de_3 6 " 

/note="GenScan; Pl=Prom, P2=0.921, P3=0.998, P4=0.782, 
P5=0.999, P6=0.998, P7=0.997, P8-0.996, P9=0.829, 
P10=PlyA" 
peptide_36 |278_aa 

MFARNIRLRKEYLYMKI^EGEERRLYEKKRKIREAIjQEGKPIPSELRNEEAKLREEIDLE 

DQNTSWPGSHIDDEYAKATEKDPKILMTTSRDPSAPLTRFVKVISEIIEMARSHDYSDV 

I FVTENRGRPDGL I VSHLPFGPTAYFQLLNVVTR YDI QSKKATGKM5.EQYPYVI FDNFTS 
QMGK^VGSMLKHMFPVPKLDARRIVTFRNKSDYISFRNHVYDKGEGGPKSIEIjK^ 
ELRLYQVKL.GTMEQEEAEMEWVLRPYMNSAKKRRFIGQ 
(SEQ ID NO:71) 

CDS CDS_3 6 | 8 3 7_bp 

atgtttgccaggaacattaggttaagaaaggagtatctctacatgaaaaacttggaaggt 

gaagagcgtcggctctatgagaagaagcggaagataagggaagccctgcaagaagggaag 

ccgattccatctgagctccgaaacgaggaggcgaagcttcgtgaggagattgatcttgaa 

gatcaaaacacctccgtagttccagggagtcatattgatgatgaatatgcaaaagcaacg 

gaaaaagatcccaagattttgatgactacgtctagggatccaagtgctcctctaacgcga 

ttcgttaaggtcatttctgagattattgagatggcgcgttcgcatgattatagtgatgtg 

atatttgttactgagaaccgtggtaggcctgatggccttatcgtttctcatctcccattc 

ggaccaactgcttactttcaattacttaatgtggtaacaaggtatgatatccaaagcaag 

aaagcaacggggaagatgtctgagcaatatccttacgtcatttttgacaactttacaagc 

cagatgggtaaaagagttgggagcatgttaaaacacatgttcccagttccaaaactggat 

gcaagacgtatagttactttccgtaataaatctgattatatttctttcaggaatcatgtg 

tatgataaaggagaaggaggcccaaaatcgatagagttaaaagaagtcggtcctcggttt 

gagttgcggctctaccaggtgaaattaggaacaatggaacaagaggaagcagagatggaa 

t999 ttctta 9accctacatgaactctgctaaaaaacgccgctttatcggccaataa W 
(SEQ ID NO: 72) - 

gene join(221006. .221045,222322. .222833,2 23 076. .223194, 

225250. .225287,225478. .225483) 
/gene= » Pept ide__3 7 " 

/note="GenScan; Pl=Prom, P2=0.575, P3=0.919, P4=0.363, 
P5=PlyA" 

peptide_3 7 | 222__aa 

MAAI TS LQAI HLKLGRRGS I RCGI S EPSGEPA PVGQKTRYNDGLAERVFMGLFARKMDKF 
GGSKKKKKDETKEKEFWEYDYESFVEVSKGVMQGRSRAQQQEVVREVLLSMLiPPGAPEQF 
RKLFPPTKWAAEFNAALTVPFFHWLVGPSQVI EVEVNGVKQRSGVRI KKCSTCLFTWDF 
EDMSCEMIYGQVPPTFEEDPATKQPCLADICSYTSKPNPPEP 
(SEQ ID NO: 73) 

CDS CDS_3 7 | 6 6 9_bp 

atggcggctattactagtctccaagcaatccatctcaaactcgggagacgtggcagcatc 

cgatgtgggatctcggagccgagcggagagccagctccggtagggcagaagactagatac 

aacgatggcttagctgagagagtgttcatggggctgttcgcgaggaagatggacaagttt 

99tggctcgaagaagaagaagaaggatgagacgaaggagaaagagttttgggaatacgac 

tacgagagcttcgtcgaggtttcaaagggagtgatgcaaggacgatcaagggcacagcag 

caagaggttgtgagagaggttcttctctccatgctccctcccggcgctcctgaacagttt 

agaaagctcttcccaccgacgaaatgggctgcggagttcaatgcagctcttacggtgcct 

ttctttcactggctggttggtccttctcaggtcatagaagtggaagtgaatggtgtgaaa 

cagagaagtggggttcgtataaagaaatgcagtacttgcctatttactgttgtagatttt 

gaagatatgagttgcgagatgatatacggacaagtgcctcctacatttgaagaagatcca 

gccacaaaacaaccttgtttagcagacatatgttcttacacatccaaaccaaatccacca 
gaaccttaa 
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(SEQ ID NO: 74) 

gene complement (join (226644 .. 22664 9 , 22 74 68 . .227721, 

22 772 9 . . 228119, 228766 . .2288 05) ) 
/gene= " Pept ide_3 8 " 

/note="GenScan; Pl=Prom, P2=0.923, P3=0.400, P4=PlyA" 
peptide_3 8 | 214_aa 

I^NVIAKKFTAMFNHRRLNLSRSSLFTASAASVISLiVFTIFIVSHVIiVRDFTEVVTIEI 

KTVVPYLPLRSEREQSNYTFTVKKDl^LHVLDVFGGRDVSGKFQQRVTEFLREDCEVlfFM 

MTWISPAVMFERFMRAITFQFETLSSSKSGVIQNLKGSDQIKILTNSETISLPVNDPRKI 

IHNTKPRDSRAVGNLYRNNVKEEMYASDFCCCQV 

(SEQ ID NO: 75) 

CDS CDS_3 8 | 6 4 5_bp 

atggataacgtcatcgccaagaaattcacggcgatgtecaatcaccggcggttaaacctc. 
"TTcacgatcatcactcttcacagcatctgcagcctcagtcatatctctcatcgttttcacg 
atcttcatcgtctctcatgtactggttagagacttcacagaggttgtgacgatagagatc 
aagacagttgttccttacttacctctgaggtcagagagagagcaaagtaactatacgttc 
acagtcaagaaggataacaatct tcacgttct cgacgtt tttggaggcagagacgtgtcg 
gggaagtttcagcagagagtaacagagtttctgagagaagattgcgaagtctacttcatg 
atgacgtggatct ctcccgcggttatgttcgagaggttcatgcgagccatcacatttcaa 
ttcgaaactctgtcgtcctcaaaaagcggagtaattcaaaatctgaaaggaagtgatcag 
atcaaaattct tacaaacagtgagactatttcct tacctgtgaacgacccgaggaagata 
atacataatacaaaacctcgagattcccgagccgtagggaatttgtatcggaacaacgta 
aaggaggagatgtacgcaagcgacttctgttgctgtcaagtatga 

(SEQ ID NO: 76) 

gene .j° in (229427 . .229466,229533. .229550,22 9588. .229880, 

232896. .232914,233006. .233230,234577. .234582) 
/gene= " Pept ide_3 9 " 

/note="GenScan; Pl^Prom, P2=0.588, P3=0.266, P4=0.420, 
P5=0.513, P6=PlyA" 
peptide_3 9 | 184__aa 

MVSRNKFAHNFTEKSRFSDYGLYRSVAKFLEKAFLEQREKGEEENASLLQICRGTTQTES 
ARRFYETLDVTSGFSVRFNFVGFIGFQVLETLIilDLRWALTCSRVCACMQTQHKLEMDKR 
DGVFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPG 
YPQH 

(SEQ ID NO: 77) 

CDS CDS_3 9 | 5 5 5_bp 

atggtcagcaggaacaagtttgcacataatttcacagaaaaatccagattttctgattat 
ggattatacaggagtgtggcaaaattcctggagaaagcgtttctggagcaaagggagaaa 
ggtgaagaagaaaatgcgagtttgttacagatttgcagaggaacaactcagacggaaagc 
gcaagacgcttctacgagactttggatgtcacatctggtttctctgtccggttcaacttt 
gttggatttatcggttttcaagtattggaaactctcttaattgattXgagatgggcacta 
acgtgttctagggtctgcgcgtgtatgcagactcaacacaagcttgaaatggacaaaaga 
gatggagtgtttggtcctcagccaatgggagttccaccggcacagcagatgtcccgtttt 
gaccaacctgcccctccagtcggctaccctcctgcgtct tacccaccggctcaaggctac 
cctcctgcaccttacccaccggctcaaggctaccctcctgcatcttatccgcctcctggt 
tatccccaacattga 
(SEQ ID NO:78) 

gene complement (join (235142 .. 235147 , 236154 236602 , 

24193 8. .242 031,242357. .242443,242643 . .242747, 
242822. .242 8 84,24 33.93. .243538,243 887. .244712, 
24 8 084. .2482 59,251494. .251614,253 62 9. .253 668) ) 
/gene = " Pept i de_4 0 11 

/note="GenScan; Pl=Prom, P2=0.369, P3=0.281, P4=0.229, 
. P5-0.841, P6=0.843, P7=0.996, P8=0.954, P9=0.712, 
P10 = 0 . 019, Pll=PlyA" 
peptide_4 0 | 68 8_aa 
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M FNAW I HRE YVDNEDD YCLL IHSPTIS F YRPEQYKLTTE PKNYVF S FGEKX.CFS VLMGNC 
VFDFDRKFGFSVLAEIAFFGFGGKLYFSILAGNYIFPVLRGFQPQDTDVIIASYPKSGTT 
WLKAiTVALLERSK^HSSDHPLLYHNPHGIIPFLEIDVYHESSSPNliAKFSAPPRLFSTH 
MPLHTIHEALKHSPCKIVYVCRNVKDTLISCWFYSCAIYKIEPTRRVLESMFNEFCDGTN 
YFGPFWDHLIiSYWRGSLEDPKHVLFMRYEEMKAEPRDQIKRLADFLGCPFTKQEEDSGSV 
DGILDLCSLRNLSSLEANKTGTINNVEHKFFFRKGEVGDSKNYLTSEMEKDRHDHQRKTS 
RFWFEFLEFLEFYYYVEQNICFTFGSGLLDFDPSEVRREIVFEFSSSSHDFDI IERGSFE 
SKMHDFYGDDGGDSETSISVSVIE^KEEYGLFVWPCSVILiAEYVWQERSRFRGSSVLEL 
. GAGTSLPGIiVAAKVGANVTLTDDSSKTEVMGLTWGVWDATI FDLRPNI I LG AD VL YDS SG 
LIRKLISTLFWFIGLPDFLEPEPVTSSWPDPPPNSTTSSHHDSHLFSAALLAGQILPVVR 
FSDLNRPESECCAVCIiYDFENDDEIRRLTNCRHIFHKECLDRWIMDYSQMTCPLCRTQFV 
PDELQTDFSQKLWSESSEDVSELLAQSS 
(SEQ ID NO:79) 

CDS CDS_4Q4~2 067_bp - ~~~ . 

atgttcaacgcttggattcatcgagagtatgtcgataacgaggacgattattgcttactg 

atacatagcccaactatcagtttttacaggcccgaacaatacaaattaacaactgagccc 

aaaaattacgtttttagttttggcgagaaattatgtttctcggtattgatgggaaattgc 

gtttttgattttgacaggaaattcggtttttcggttttggcagaaattgcatttttcggt 

tttggtggaaaattgtatttttcgatcttggcaggaaattacatttttccggttttgaga 

ggttttcaaccgcaagacactgatgtaatcattgcttcgtaccccaaatcaggcactact 

tggctcaaggccctcacagtcgctctgcttgagagatcaaagaaccactcttctgatcat 

cctctcctatatcataatcctcatggcattataccattcttggagatcgatgtgtaccac 

gaaagctcaagtcct aacctagccaagttctcagcacctccgaggctgttctcgactcac 

atgccactgcacacgatccacgaagcactcaagcactctccttgcaagattgtgtacgtg 

tgcaggaacgtgaaggacacgttgatctcgtgttggttttacagctgtgctatatataaa 

atcgaaccaaccagaagagttctcgagtctatgtttaacgagttctgcgatggaaccaac 

tattttggacctttttgggatcatctcttgagttactggagaggaagcttggaagaccca 

aagcatgtccttttcatgaggtatgaggagatgaaagccgagcct cgtgatcagatcaag 

agacttgcggacttcttgggatgtccttttactaagcaagaagaagatagtggatctgtg 

gacgggatcttggacctctgctctctgcgtaatctgagcagtttggaggctaacaaaaca 

gggacaataaacaatgtggagcacaagttttttttccgtaaaggagaagtcggtgactcg 

aaaaattatcttacgtctgaaatggagaaagatagacatgatcatcaaagaaaaact tea 

aggttctggtttgagtttttagagtttttagagttttattattatgtcgaacagaacatt 

tgttttacatttggctctggcttgctagattttgacccatcagaagtgagaagagagata 

gtgttcgaattcagcagcagttcccacgacttcgacattattgagcgtgggagctttgaa 

tctaagatgcatgatttctatggcgatgatggaggagattccgaaacttccatctctgta 

tctgttatcgagaatatgaaggaagagtatggtttgttcgtttggccttgtagcgtcatc 

ctcgccgagtacgtctggcaagagcgatctcgatttcgtggctcttcagttctcgagcta 

ggagctggcacttctttacccggtttagtagctgctaaggttggagctaatgtcaccctt 

accgatgactcaagcaaaacagaggtaatgggtctcacttggggagtgtgggatgcaacc 

atatttgatctgcggcctaacattatacttggagcagatgttctatatgattcaagtggt 

ttgatacgaaaactaatctccacacttttctggttcattggtttacccgatttcttggaa 

cccgaaccggttacatcttcatggcccgacccaccaccaaactccaccacctcaagccac 

catgactctcacttgttttcagcagcgctgctagctggacagatcttgcccgttgtcaga 

ttctcggatctaaaccgacccgaatccgaatgttgtgcagtgtgtctctacgacttcgag 

aacgacgatgagatccgacggctgacgaattgcaggcatattttccataaagaatgcttg 

gaccgttggattatggattatagtcagatgacgtgtccgctttgtcgtacccagtttgta 

cctgatgagcttcaaacggactttagtcaaaagctttggtcggaatctagtgaagacgtt 

tctgaacttcttgctcaatcatcttag 

(SEQ ID NO:80) 

gene join (254 094 . .2 54133,2543 99. .2 5723 6,257535. .257678, 

257835. .258 083,258167. .258 907,259556. .259561) 
/gene=" Pept ide_4 1 " 

/note="GenScan; Pl=Prom, P2=0.350, P3=0.377, P4=0.685, 
P5=0.683, P6=PlyA" 
peptide_41 | 1323_aa 

MSKQIDELRSSQNQQTEELGSKINALEALIEKYFANAPPPQRDGKQTDASSDITDGTPQA 
KAPPDRSNPENSSFKPHDNNNPPIHHSLSARLTKIGFPMFDGSELREWTYGCEQFFSIDS 
TPPELKVRLASLHMTGKALQVmHSYLANRYNIFSLWPEYVAAISDRFSELYDDPLAELVS 
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LKQGNDTIDVYIiDKFDCAMTRITLAPDHALSIFLTNMNQHIoALHVT^QFKVSTVPEAAKIA 
KIjHELSIjSHMPTKTSRPPFNSSQRSNYSQPNKSQNHNSTSPTTTANPNNKPLIANAPQKW 
LSFDEMQERKRKGLCMFCEEPFTPGHHLKHKRAEFLFLDLDAETEFDDEIAliVEQIRETT 
I SDDDDKVPT I S VHALNGAPTFNCMRLVRKYEKRKL.H I L> I DPGSTHNFLD I QMAKGLGCS 
LTPIKPMSVVAASGDLVTKYKCSSFAWKMQGYGFTAEIRTLPLGCSDLVLGVQWLSTLGP 
ILWDFLNLRMEFKFNELKHVLRGISPNSSKLISGSSFNKliMLQDPQLALLHLREIDETTE 
QEPLiEPETIFCHIEASETENDNSGSLERLbDSYTDVFDEPSTLPPYRAGFNHKIPLEAGS 
NPVKTLRPYRYSSIQKDSIDKMIQDMLSQGI IQYSASPYASPIVTjVKKKDGSWRLCVDYRG 
LNKQTIKDKYPIPL.LEDLLDELGGSKYFSKLDLRAGFHQLCMSPEDVHKTAFKTHSGHYE 
YLVMPFGLTNAPCTFQGLIWHVFAPVLRKFLLVFFDDIIilYSKTWEEHLDHLDKVLAILR 
HQQIiYLKKSKCTFGGTRIEYLGHFISHDGVSTDPTKIKAVEEWPQPKHQKHLRSFLGLAN 
YYRRF I QGYS 1 1 ARPLTI MLRKDGF AWNTEASDAFHLLKQALI SAPVXrALPDFSKTF I VE 
TDASISTTGIGAILMQDNHPVCYISRALGPRHQGLSVYEKELLAVVHAIITDLQNKPGSHAA 
YSFVNGELRRRGKIjVVGNDPAIKLHIFKWIJ^DSAPIjPVPTGVWESVSIjDFIEGLPPSSGK 

hcilvvidrlsknahflalshpytai^ 

ktletylrcmtsdspstwsawlplaewwynttyhtairsspfb-iiygqpppvhlpylpge 

STSTTVDRSLQRREELIDMMKFHLLRAQNRMKQYADSHRSERAFHIGDYVYXjKLQPYRQH 
SLKGRHLPHKLSPRFYGPYEIQDRVGNLiAYKLRLPFEAAIHNVFHVSQLKLGPNPPATPS 
SLPQYLKIDVGTAKEPEKILETKMVNRRNRAV^ 
FNT 

(SEQ ID NO: 81) 

CDS CDS_41 | 3 972_bp 

atgagcaagcaaatagacgagctacgt tcttcgcaaaaccagcaaactgaagaactcggt 

agcaaaatcaacgcactcgaagcactcatcgaaaagtacttcgctaacgctccacccccg 

caacgcgacggtaaacaaacagacgcaagttctgatattacggatggaacaccgcaagct 

aaggctccaccagaccgttccaatccagagaacagttcctttaaacctcacgacaacaat 

aacccacccatccatcatagcctatccgcaaggctaacaaagattggctttccaatgttt 

gacggctccgaactacgagaatggacctacggctgtgaacagttcttctccatcgacagc 

accccaccggaattgaaggttcgtcttgcatctcttcatatgacgggaaaagcactacaa 

tggcatcactcttaccttgccaatcgatacaacatct tt teat tat ggccagaatatgtt 

gctgcgatctccgatcgtttcagtgagctttacgacgatccattagcagagttggtaagc 

ttgaaacaaggaaacgataccatcgatgtgtatctagataaatttgattgcgccatgacc 

agaatcacgcttgcgccggatcacgcattgagtatattcttgacaaacatgaatcaacat 

ctagctcttcacgtgcgccaattcaaggtcagtacagtacctgaagccgcaaagatagcc 

aaactacacgagctctccctctcacatatgccaacaaagacatcgcgccccccattcaac 

tcttct caacgatcaaactactcccaacccaataaaagccaaaaccacaactccacctcc 

cctactactaccgccaat ccaaacaacaaacccc teat tgcgaat get cctcaaaaatgg 

ctttcttttgacgagatgcaggagcgcaaacgtaaaggattatgcatgttctgtgaagaa 

ccgtttacaccaggtcaccaccttaagcataagcgcgctgaatttttgttcttggactta 

gacgctgagacagaatttgacgacgagattgcactagtggagcaaatccgtgagacaacc 

ataagcgacgatgatgacaaagttccaactatctccgtccacgccctcaacggtgcgcca 

actttcaactgtatgcgcctcgtcaggaaatacgagaaacgtaaactacatatactgatt 

gatccagggagcacacacaacttcttggacatccagatggctaagggattaggttgttct 

ttgacaccaatcaaaccaatgtcagttgttgcagcaagtggcgacttggttactaagtac 

aagtgcagctcttttgcttggaagatgcaaggctacgggttcacagctgagattcgaacc 

ttaccactaggatgcagtgatctcgtcctgggggttcaatggctttccaccttaggacca 

atcctatgggatttcctaaaccttcgtatggagttcaaattcaacgagctaaaacatgtc 

ttacgtggaatatcacccaacagctcaaaattaatttctggaagcagctttaacaaactg 

atgttgcaagatccacagcttgctctactccatcttcgcgagattgacgaaactacagag 

caagaacccct tgaaccggaaacaattttctgt catattgaagcgagtgaaacagagaac 

gacaattccggttcactcgagagactccttgattcatacacggacgtatttgatgagccg 

tcgaccttacctccttatcgcgcgggtttcaatcacaaaataccattggaagcaggatca 

aatcctgtgaatctccgaccatat'cgatactcttcgatacaaaaggattcaatagacaag 

atgattcaagacatgctttctcaaggtatcattcagtacagtgcaagtccctacgcttca 

cctattgtacttgtgaaaaagaaagacgggtcttggcggctttgtgtcgactacagaggt 

ctcaataagcaaacgatcaaagacaaataccccatcccactactcgaagatctccttgat 

gaattgggcggctcaaagtacttctctaaactggatttacgtgcaggtttccaccagctt 

tgtatgtctccagaagatgtgcacaagacagcttttaaaacacattcaggccactacgaa 

tatttggtaatgccattcggcctcacaaacgcaccttgcacgtttcaggggcttatgaat 

cacgtattcgcacccgttctacgaaagtttctccttgttttcttcgacgatatattaatc 
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tacagcaagacctgggaggagcacctggatcacttggacaaggttcttgctatacttcgc 
catcaacaactctatctcaagaaatcaaagtgtactt ttggaggaacgagaatcgaatac 
cttggtcactttatttcccatgatggcgttagtactgacccaaccaagataaaggcagtc 
gaggaatggccacaaccgaagcaccagaaacacctccgcagcttcttaggtctagccaat 
tactatcgaagattcatacaagggtacagcattatcgctcgaccccttaccatcatgctt 
cgcaaagacggttttgcttggaatacagaggcttcagacgcgttccatctcctcaaacaa 
gcattaatctcggccccggtccttgcactccccgatttctccaagactttcatcgtcgaa 
accgacgcttccaacactggcataggcgcaattcttatgcaggacaatcacccagtgtgc 
tacataagtcgtgcattagggcctcgacaccaaggcctttctgtttacgagaaggaactc 
cttgctgtggtccacgcaatcattaccgacctgcaaaacaaacctgggtctcacgccgcg 
tactcct ttgtcaacggagaacttcgccgccggggtaaacttgtggttggtaacgaccct 
gctatcaaacttcatatcttcaagtggcttcatgattctgctccgttaccggttcctacg 
ggtgtatgggagtctgtgagccttgactttatcgagggtctaccaccatcatcaggaaaa 

cattgcattctcgtcgttattgai:.cgtttaagcaagaatgctcactttctcgccttgtct_ 

— catccgtacactgcaatggacgtggcaaagctatatatggatcaggtctttcgtcttcat 

gggatgcctaaagacattacaagcgatcgtgacccaacgtttctcagcgaggttactaac 
aaaactttggaaacctatctccgatgcatgacttctgattcaccttctacatggagcgcg 
tggttacccttagcagaatggtggtataatactacttaccacactgcgattcgcagttca 
ccgttcgagatcatctatggccagccacctccagtgcacttgccttaccttcccggcgaa 
agtacttctaccacggttgacaggtccttacaacgaagggaagagctcattgacatgatg 
aagttccacctcttgcgagctcagaacaggatgaaacaatacgcagactcgcaccgttct 
gaacgtgcgtttcatattggtgactatgtctatttgaagctccagccgtatcgacaacat 
tccctcaaaggacgacatttgcctcacaagctttcaccacgtttttatggtccgtacgag 
atacaagatcgtgttggtaacttggcatataagctacgtcttccatttgaagcagctatc 
cat aatgtcttccatgttagt cage taaagctcggccccaacccgcctgcaactcct teg 
tcgcttccccaatacctcaaagatgtcggcactgcaaaggaaccagaaaagattcttgaa 
acaaaaatggtgaatcgccgaaacagagctgtcacgaaggtactagtacagtggaaaggc 

tactctccggaacaagccacatgggagttctatcaagacttcgtcgcgaaacatcctgat 
ttcaatacttga 

(SEQ ID NO: 82) 

gene join(259921. .259960,260542. .260668,260773. .260900, 

261168. .261305,261389. .261454,261590. .261702, 
261789. .261899,262016.-262053,262136. .262244, 
2 62377. .262407,263398. .263535,263662. .263667) 
/gene="Peptide_42 " 

/note="GenScan; Pl=Prom, P2=0.999, P3=0.938, P4=0.995, 
P5=0.946, P6=0.803, P7=0.899, P8^0.700, P9=0.950, 
P10=0.623, Pll=0.924, P12=PlyA" 
peptide_42 | 33 2_aa 

MIHWGGVTCCLSAAALYLLGRSSGRDAEVLKTVTRVNQLKELAQLLELDSSKLLPFIVAV 
SGRVGSDTPIKCEHSGIRGVIVEETDDGTSRVNWGARGATGFAIiTVGSEVFEESGRSLV 
RGTLDYLQGLKMLGVKRIERVLPTGMPLTIVGEAVKDDIGDLRIQKPERGPFYVSPKSLD 
QL I SNLGKWS RL YKYASMGLTVFGVFL I TKHV I DFbLERRQRRE I QKRVLDAAAKRAGTR - 
-OSNGAHESVSDSTKNEGAVPDLCVICrjEQNYNAVFVPCPLCRRRIDQVIKLDIGHIDPSI 
T S KGGRLELRTS ETNL VD PDS ETE I I S MTDRS 
(SEQ ID NO: 83) 

CDS CDS_42 | 999_bp 

atgattcattggggtggagtaacctgctgcctcagcgccgccgctctttatcttctcggc 
cggagtagtggcagggacgctgaagtactcaaaaccgtcactagggttaaccaactcaag 
gagctagcgcaattgctagaattagatagcagcaagctccttcctttcatcgtagccgtt 
tcaggaagagttggctctgacactcctatcaagtgcgagcatagtggcatacgcggcgfet 
atcgtcgaggaaacggacgatgggacaagtcgtgtgaatgtagtgggagctcgtggtgca 
acaggttttgctttgaccgtcggaagtgaagtttttgaagagtcagggcggtctcttgta 
cggggaacacttgattatctccaaggccttaagatgcttggagttaagcgcattgagcgt 
gttcttcctactggaatgcctctcacaattgttggtgaggctgtcaaggacgatattggg 
gacctaaggattcagaaacctgaaagagggcctttctacgtctctcctaaatcactcgat 
cagctcatttctaatctggggaaatggtcaaggttgtacaagtatgcctccatgggttta 
actgttttcggtgtgtttctaattacaaagcatgtcattgattttcttctagagagaaga 
cagcggcgagaaatacagaaaagagtgcttgatgcagcagctaagagagctgggactgaa 
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ggt tcaaacggcgcacatgagagcgtatcagattctaccaagaatgaaggcgctgttcct 
gatctctgtgtgatctgccttgagcagaactacaatgctgtgtttgtcccctgtccactt 
tgtcggagacgtatagatcaggttatcaagttggatatcggccatatcgacccatcgatc 
acatctaagggcggtcggttagaacttagaacctcggaaaccaatctggtcgacccagat 
tctgagacagagataatcagtatgacagatagatcctaa 
(SEQ ID NO: 84) 

gene complement ( j oin (263813 . .263818,263901. .264080, 

2 64138 . . 2 684 81 , 26 8 603 . . 2 68619, 268932 . .270282, 
2703 83. .270724,270866. .271945) ) 
/gene= M Pept i de_4 3 " 

/note="GenScan; Pl=0.999, P2=0.973, P3=0.024, P4=0.030, 

P5=0.016, P6=0.564, P7=PlyA" 

peptide_4 3 | 24 3 7_aa 

MIAKIAKDVLQKLNATPCRDFEGIVGLDAHLKEIESLLDLDYDGVKMVAITGPAGIGKTT 
IARALHSLLSRRFQLTCFVDNL,REICPSGLDEYGWKKRLQENFLSKVIjNHKDMRICHLGA 
VKENLCDQKVXi 1 1 LDDVNS LKQLEALANES AWFG PGSRI WTTENL E I LQQHG I GSTYHV 
GFPFDEEAIEILCRCAFRLRSPLYGFEVIjCTRI IRLCGNIiPLALCVVGKSLRGKKKDEWE 
DVVI^IjETILDQGIEDVIjKVGYESLEENEQTLFLHIAVFFNNEDSDLVKAMFSDGTIDVK 
RGLQILVYRSLIEISTYGNIIEMHKLLRRVGRQAIHKQEPWKRQILLDAHEICDVLENET 
GTRAVSGISFDISGINDLIVDERAFKRMYNLQFLKVYKSRDDKIjKFDILAGEKSRDDMND 
VIHIPEEMKFPRRLRFLHWNAYPNKCLPPSFHPEYLVELDMKHGKFEHLWKGTQRLANLK 
KLDIjSFSCQLKELPDLSRATNLKILDMSYCKSIjVEIPSSFMHLQKLELWWMDDCVNIjQVI 
PDHLNI^SIiGIVEMNGCSKLRKFLFISTTSPRLSICGDSFRQRPPSSGMYSRIRKSGKLK 
GLTHLHKSLICLDLRYSDIETIPDCIKDLHQLEQLSLDGCRRLASLPELPGSIKSLNAED 
CESLETVFCPLNSSPNAVLDFTNCFKLGQQARREIIHRSLSCEWVILPGKKVPAKFDHRA 
RGSSLTIRIPYGNNPLSAVSILKLCWVSPNHQISKKSRHSEYLVCRCISKGDLDPWEE 
FCVSYVSRYRSEHLLIFHPRLPFIVPSEVSRETVFEFSSKSHEFDVIECGAKVFEDKSIQ 
GSYESGSDQVSEDHIDYIiTDGRYESCEDDINSLIDESLELiNPANPLQATPYESSEEREWS 
EDEYELSEQYRQSRRQFRAQCRRRRSSIFEKSPFLGTETMVETRLQERSLTEQVDELRSIi 
HDLLAA ELKS RSDSLDARFDRLEALMFS ANAS PLHATGKAPLDPGPSHPPTPFNIiGSSQP 
PDPPDLNGFRQHNEFGRLTSRLSKISFPKFDGTDLRDWLSKCEQFFDIDGTSQELKVRLA 
AMHLTGRATQWHTNYMSTRFGMFPSWTDYIIAISARFCEIiFDDPLAELVALKQGSDLVT^A 
YLDKFETTRMRLVIjPEAHALSIFLANMNPHLSLHTRQFETTSIAGAAKIAMLHESSLSHT 
PNRQRAPFNPYPNTKPYQKPNNPSPLLPLTQAPTNQKPSFIPRNPTDKPPRKFSYQEMQD 
RRSKGLCMFCDEPFTPGHQLKHKRSQIYVMECDDADTIPDDSSSDAEPESDDKQATVEVT 
PVISINALNGSTSYNCMRLIGHHGKHKLHILVDPGSTHNFVDLNIATQLGCELEPTRPMS 
VKAATGDTLiLTNFKCS AFTWTVQGS S FTTE I RTVPLDCCDFVLGVQWLCTLGP I LWDFLN 
LRMEFTLSGTKHVLRGVVKTGGKVI KGSSLNKXjMLQEPQIALIQLQQIDENADNQQSLNP 
EMLYSHI SASGTTNADDPALQQLLQTFEDI FQEPKSLPPFREGFDHQI PLLAGSNPVNLR 
PYRYSSLQKDTIDTMIKEMLTQGI IQHSASPYASPIVIjVKKKDGTWRLCVDYRGLNKQTI 
KDKYLIPLLEDLLDELGGAKYFSKLDLRAGFHQLRMSEDDVYKTAFKTHQGHYEYLVMPF 
GLTNAPCTFQSLMNHVFQDLSRKSVLVFFDDILVYSKTWEEHLQHLAEVFLILQQQQLYL 
KiSKCTIGATIIEYLGHFISADGVSTDPRKIAVIRDWPIPTTQKHLRSFLGLANYYRRFI 
KGYSSIARPLSTLIiKXTXSFTWSLEADQSFSDLKAALSSTPVLALPDFDQPFIVETDASNT 
GIGAVLMQGKHPICFISRSLGPRHQNLSWEKELMALVHAVQTWHPYIAHRPFIINTDQR 

. SLKYLMEQKITTPFQHMWLSKXiMGYNFEIHYKQGKDNVVADALSRVSGSQLLNMVLSQAH 
TGF YDS LKLLWETDATLQKI I CDLKSNS SSHPLFTYTNGELRRRGKL WGNDKD I KLH I F 
KWLHDSAIGGHSGRDATLHRIRSLFYWPKLNVEVQNYIRNCSICQQNKYDMAAKPGLLQP 
LPI PDGIWESISLDFIEGLPPSQGKHCIMVVVDRMSKNAHFIPLSHPYTALTVAQAFLDN 
IFKLHGMPKDVISDRDPIFISEVWNELFRVQGVTLKRSTAYHPQTDGQTEVTNKTLETYL 
RCMAAETPTSWSKWLSLAEWWYNTTFHSAIQATPYEVIYGQPPPLHLPYLPGESSSVVVD 
RSLQKREEVINMLKFHLIiRAQNRMRQYADAKRSQREFKIGDYVYLKLQPYRQHTVKKNK^ 
PHKLSPRFYGPFRVIjDRIGKVAYKLSLPTEAAIHDTFHYWLDLGNTKEPAAIIjETKTVkR 
QNEAATKVIjVHWKDES PEMATWEF YKDFTNKYPLFNP 
(SEQ ID NO: 85) " 

CDS CDS_43 | 73l4_bp 

atgattgcgaagattgcaaaagatgttttacaaaaactgaatgcaacaccgtgtagagat 
tttgaaggcattgtgggacttgatgctcatctaaaggaaatagagtctttgttagatctg 
gattacgatggagttaagatggttgccatcactggtcctgcagggatcggtaagactacc 
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atagctcgagctttacatagtctactctcccgtaggttccagcttacttgttttgtggac 

aaccttagggaaatctgtcctagtggtcttgacgaatatggttggaagaaaaggttacaa 

gaaaattttctttcaaaggttttgaaccacaaggacatgaggatatgccatttaggtgca 

gtaaaagaaaacctatgcgaccagaaagtgcttatcattcttgatgacgtgaacagtcta 

aagcaattagaggcgttggctaatgagagtgcatggtttggtcctgggagtaggattgta 

gtgaccacagaaaacctagagattttgcagcaacatggtatcggtagcacttaccatgtg 

gggtttccatttgatgaagaagct'atcgagatcttatgtagatgtgcttttagactaagg 

tctccactttatggttttgaggtgctttgtacaaggataataaggctttgtggtaatctt 

ccattggctctatgtgtggtgggtaaatctttacgtgggaagaagaaggacgaatgggaa 

gatgtagtgaacagactggaaactattcttgatcaaggtatagaggacgtgctaaaagtg 

ggctatgaaagtttagaggagaatgagcaaaccctatttctccacattgcagtcttcttc 

aacaatgaagatagtgatcttgtgaaagccatgttctctgacggtacaatagatgtcaaa 

cgcggtttgcagatcctagtctatagatctctcatagagatatctacatatgggaatata 

atcgagatgcacaagctactacgacgagtgggtagacaagccattcataaacaagagcct 

tggaaacgccaaatcttactggatgcgcatgagatttgcgatgtcctcgaaaacgaaaca 

ggaactagagctgtgtccggcatatcatttgatatatcaggaatcaacgacctgattgtt 

gacgaaagagcttttaaaagaatgtataatcttcagtttctcaaagtctacaaaagcaga 

gatgacaaactaaaatttgacattcttgctggggaaaaaagcaga'gatgatatgaatgat 

gtaatacatatacctgaggagatgaagtttccacgccgtctaaggtttctacattggaat 

gcataccctaacaagtgtcttcctccttcctttcatcctgaatatctcgtggaacttgat 

atgaagcatggcaaattcgagcacctctggaaaggaacccagcggcttgcaaatctcaag 

aagcttgatctgtcattttcatgccagttgaaggaactcccggatctttcaagagcaaca 

aatctgaagatattggatatgagttattgcaagagtttggtagagattccatcctctttt 

atgcatcttcagaaactagaattgtggtggatggatgattgcgtaaacctacaagtcatt 

ccagaccacctgaacttggcttctcttggaattgtcgaaatgaacggatgttcaaaattg 

agaaagtttctgtttatttcaactacttcaccgagattgagtatatgtggggactctttt 

cgacaacggccgccatcatctggtatgtactctcgtataaggaaaagtgggaaactcaag 

ggattaacacatctccacaagagtctaatatgtctagacctacggtattctgatattgag 

acgattccagattgcatcaaagatcttcatcagctagagcaactctcccttgatggatgt 

agaagactcgcatcattgccagagctccctggttcgatcaaatccttaaatgcggaagat 

tgcgaatcactggagaccgtgttttgccctttgaatagcagtccaaatgcagtactcgat 

ttcaccaactgcttcaaattaggccaacaagcacgaagagaaattatccaccgttccttg 

tcttgtgaatgggttatcttaccaggaaaaaaagtacctgccaagtttgatcaccgagcc 

agaggaagttcattgaccattagaattccctatggtaacaatcctctttctgctgtatct 

atattgaagctttgcgttgtggtgtcacctaaccatcaaatcagcaaaaaaagtagacat 

tcagaatacttagtgtgccgctgcataagcaaaggcgacctagaccccgttgttgaggag 

ttttgcgtaagctatgtctccagatatcgatctgaacatctacttatatttcaccctcgc 

ttgccattcatcgtcccctccgaagtcagcagagagacagtgttcgaattcagcagcaaa 

tctcacgaattcgacgttattgaatgtggtgccaaagtgtttgaagacaagagcatccaa 

gggagctatgaatctggatcagaccaagtgtctgaagaccacattgattacctcactgat 

gggaggtacgaatcatgtgaagacgatattaatagtctgatcgacgagagccttgagttg 

aacccagcgaatcctttacaggcgacaccgtatgaatcgagtgaggagcgcgaatggagt 

gaggacgaatacgaattgagtgaacagtaccgccaaagcagacgacaatttcgagctcag 

tgccgccgaagaagatctagcatcttcgagaagtcacctttcctcggtaccgaaaccatg 

gttgaaacacgactccaagagagatctctgactgagcaagttgacgagttacggtcctta 

cacgacctcctcgctgctgagcttaagtctcgttccgactccctcgacgctcgtttcgac 

agactcgaggccctgatgttctccgccaacgcctcaccgctccacgccaccggaaaagcc 

ccactagaccccggcccctcccaccctccgacacccttcaacctcggttcttcccaacca 

cctgaccccccagatctcaacggattccgtcaacacaacgagtttggcagactcacatct 

cgcctttccaaaatctcgttcccaaagttcgatggcactgacctccgcgactggctttcc 

aagtgtgagcagttctttgatatcgacggcacctcacaggagctgaaggtacgcttagct 

gcgatgcatctcaccggaagagctactcaatggcacacaaactacatgagcactaggttt 

ggcatgttcccctcttggacagactatataatcgccatatctgctcgtttttgtgaactc 

tttgacgaccctcttgctgagctagtggccctcaagcaaggctccgatttggttgctgcc 

tatctcgacaagtttgagacgacaaggatgcgccttgtccttcctgaggctcatgctctt 

agcattttcctggctaacatgaaccctcatctctctctccacacgagacagtttgaaaca 

acctccattgcgggcgctgcaaagattgctatgctccatgaatcctctctctcccatacc 

ccaaaccgacaaagagcccccttcaatccttacccaaacacaaaaccctaccaaaaacca 

aacaacccttcccccctcctacctctaacccaagcccccaccaaccaaaaaccctccttt 

attccccgaaaccccactgacaaaccacctcgaaagttctcctaccaggaaatgcaagac 

agaagatcgaaaggcttatgtatgttttgtgatgaaccatttactcctggccatcaacta 
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aaacacaagcgctctcagatctatgtcatggagtgtgatgatgctgataccatccctgac 
gatagctcctctgatgccgagccagagtcagacgacaagcaagcgactgttgaagtcaca 
ccagtcatctccatcaatgccttaaacggctccacctcctacaactgtatgcgcttgatt 
ggtcatcacggcaaacaeaagctgcatatacttgtagaccctggaagtacccacaacttt 
gtggacctcaacatcgccactcaactgggttgtgagctcgaaccaacgcgcccaatgtca 
gtgaaagctgcaactggtgacacccttctcacaaacttcaagtgctcggctttcacttgg 
accgttcaaggctcctcctttacgacggaaatccgcactgtccctcttgactgctgcgat 
tttgtgcttggagtgcaatggctttgcactttgggacctattctttgggacttcctcaac 
ctcagaatggagttcacactctctggcaccaaacatgtacttcgtggcgtcgttaagaca 
ggaggcaaggttattaagggctccagcttgaacaagctcatgctccaagaacctcaaata 
gccttaatccagctacaacagattgatgagaatgccgacaaccagcaaagcctcaatcct 
gaaatgttgtactctcacatctctgcttcgggaacaacaaatgctgacgacccagctctc 
caacaacttcttcaaacgtttgaagatatctttcaagaacctaagtcactgccaccgttt 
agagagggttttgatcaccagataccactccttgcgggctccaaccccgtcaacttacgc 
ccgtatcggtactcttctcttcaaaaagatacaatcgatacgatgatcaaggaaatgctg 
actcaagggatcattcagcacagtgctagcccctacgcctcacccatcgtcttggtcaag 
aaaaaagatggcacgtggcgtctttgtgtggattacagagggctcaacaagcagacgatc 
aaggacaagtatctgatccctctgctagaagacctcctcgatgaacttggtggcgccaag 
tacttctctaagttggatcttcgagcaggttttcaccaactccgaatgtcggaggatgac 
gtctacaagacagcattcaagacacatcaagggcactatgaataccttgtcatgcctttc 
ggcttgacaaacgccccttgcactttccagagtcttatgaatcatgtatttcaagacctc 
tctcggaaatcagtgcttgtctttttcgacgacatccttgtatacagcaagacttgggag 
gaacacctacaacaccttgcggaggtcttcctgatcctgcaacaacaacagctgtatctc 
aaactctctaaatgtactattggtgcaacgatcatagagtaccttggccactttatttct 
gctgacggcgtgagcaccgatccccgaaagattgctgtaatacgtgattggccaatccct 
acgacacagaagcacctacggagcttcttgggattggcaaactattaccgccgcttcatc 
aaaggttatagctccatcgctcgccccctcagcacgctactgaaaaaagatggctttact 
tggtcccttgaagctgatcaatccttctcagacctcaaagcagcgctcagttccacgcct 
gtcttagccctccccgactttgatcaacctttcattgttgagacagacgcgtcaaacacc 
ggtattggtgcggtccttatgcagggaaaacaccctatctgcttcataagtcgctccctt 
ggtccccgccaccaaaacctgtctgtgtatgaaaaggagttgatggcattagtgcacgca 
gtgcaaacctggcacccctaccttgcgcatcgccctt teat cat caacactgatcagcga 
agcctcaagtacctcatggagcagaaaataactacacctttccagcacatgtggctctct 
aagcttatgggctacaacttcgagatccactacaagcaagggaaagacaatgttgttgct 
gacgctctctctcgagtctctggctctcagttactgaacatggtactatctcaggctcat 
actggtttctacgattcgttgaagc tact ctgggaaaccgatgct act ctccagaagatt 
atctgcgacttgaaatctaacagctcttcacaccccctcttcacatacacaaacggagag 
ctgagacgtcgaggcaagctagtcgttggaaacgataaagatatcaagttgcacatcttc 
aaatggcttcacgactctgccattgggggacactctggacgtgacgcaaccttacat cgt 
atcagatctttgttctattggcctaagctaaacgtggaggtccagaactacattcgaaac 
tgtagtatctgccaacagaacaagtacgatatggctgccaaaccgggtcttctacaacct 
ctaccaattcctgatgggatctgggaatctataagcctagacttcattgaaggtctcccg 
ccgtctcagggcaaacactgtattatggtagtggtcgataggatgagcaagaacgcacac 
tttatacccttgtcccacccatacacggccctcactgttgctcaggcattcttggacaac 
atcttcaaactccatggcatgcccaaagatgtcatcagtgacagagatccgatctttatc 
agtgaggtttggaacgagctcttccgtgtacaaggggttactctcaagcgctccactgct 
taccaccctcaaaccgacggtcaaacagaggtaaccaacaagaccttggagacctatcta 
cgctgtatggctgctgagacaccaacatcctggagcaaatggctcagtcttgcagaatgg 
tggtataacaccacttttcactctgccattcaagctactccatacgaggtcatctacggt 
caacccccaccacttcacctcccttaccttccgggtgaaagctcctccgt tgttgtagac 
cgcagcctccagaagcgtgaggaagtgattaacatgcttaaatttcacttgcttcgcgct 
cagaaccgtatgcggcagtatgccgatgcaaagcgttcccagcgagagttcaagatcggt 
gattacgtatacttgaagctccagccatatcgccagcatacagtgaaaaagaataaggct 
ccacacaagctctcgccacgcttttatggtcccttccgtgtgctagaccgtatcggcaag 
gttgcctacaaactctcgctgcctactgaagctgcaattcatgacacatttcactactgg 
cttgacctcggcaacaccaaagaacctgcagcaattctggagactaagactgttaagcgc 
caaaacgaagctgcaacaaaagttttagtccactggaaagatgaatcacctgaaatggca 
acttgggagttctacaaggacttcaccaacaagtatcctcttttcaatccttga 
(SEQ ID NO:86) 

BASE COUNT 89612 a 47329 c 46973 g 88076 t 
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ORIGIN 

Rf Gene Region 

1 tggttttcat cccactcttg ttaccttcaa cacccttctc cacggattat gtgtggaaga 
61 tagggtttct gaagccttgg atttgtttca tcaaatgtgt aaaccaaatg tcgtaacctt 
121 caccacgctg atgaacggtc tttgccgcga gggtagagtt gtcgaggccg tagctctgct 
181 tgatcggatg gtagaagatg gtctccagcc taaccagatt acttacggaa caattgtaga 
241 tgggatgtgt aagatgggag acactgtgtc tgcattgaat cttctgagga agatggagga 
301 gttgagccac atcaaaccgg atgtggtaat ctatagtgcc atcattgatg gcctttggaa 
361 agacggacgt cataccgatg ctcaaaatct tttcattgaa atgcaagaca agggaatctt 
421 tccagatata gttacctaca gctgtatgat taatggattt tgtagctctg gtaaatggag 
481 tgaagcccag cgcttgttgc aagaaatgtt agtaaggaag atcagccctg atgttgtaac 
541 tttcagtgga ttgatcaatg cattggtcaa agagggcgat cttaattctg ctcaagacct 
601 tttacaggag atgatttcta gtggtgtgtg ccctaatgtc gttacttgta acactttgct 
661 ggacggtctc tgcgatcgcg ggaaactaaa agatgcattg gaaatgttta aggctatgca 
721 gaagagtatg atggacattg atgctactca tgccttcaat ggtgtggaac ctgatgttca 
781 aacttacaat atattgatca gcggattgat taatgaaggg aagtttttag aggccgagga 
841 attatacgag gagatgcccc acaggggtat agtcccagat actgttacct atagctcaat 
901 gatcaatgga ttatgcaagc agagtcgcct agatgaggct acacaaatgt ttgattcgat 
961 gggtagcaag agcttctctc caaacatagt gacatttaac acactcatta ctggctactg 
1021 taaggcagga atggttgatg acgggctgga gcttttctgc gagatgggtc gaagagggat 
1081 agttgctaat gcaattactt acatcacttt gattcgtggt tttcgtaaag tgggtaatat 
1141 taatgggtct ctagacattt tccaggagat gatttcaagt ggtgtgtatc ctgataccat 
1201 tactatccgc aatatgctga ctggtttatg gagtaaagag gaactaaaaa gggcactggc 
1261 aatgcttgag gaactgcaga tgagtatggt atgtaagttt ctgttcagtc tatgttattt 
1321 ttaatatgaa gaagaatgta tacatgcttt tgtgtgtagc ttcagattga tgatacatgt 
1381 tctggaatta accatcggtt tggttttgca ttgtaggatc atcatttggt ggggtgaatg 
1441 atcaaagatt tttctactct ttgcagcaga gcttcaatgc attttgtttt tgttgctgca 
1501 tttgtaccct actaatgttt gatcaaatcg tggaatagag tgatcatagt gtaaaaattg 
1561 tgtggtcaat gagctgtttt gctgctattc taatgacagc ctttatgcgt ctattgtagt 
1621 ttaataaatt tgaccatttc caattaagtt ccatacactt gttcacgcaa gattattggt 
1681 gcgaaataaa gaagcacacc ttccagaaga cttcaggtgt taaaagatgt ttagtgttta 
1741 atctttcgat cccacttttt gtttgtttgt atgcaggtat attagttggt aacatggaaa 
1801 atattctcct ttattacctt gaagtagtga tattgtctgg tttagttaaa ctcgtattat 
1861 cctaactggt agtatccatg agattcaatt ggtatgtgcc tgtctacaac ataggagttc 
1921 ttctctatga tcattctatt tcaaaatatt tatatgtttc aggtgtgagg caggtcaatt 
1981 agtggcaagg aagcttatcc tcaacttggt tcataatttt ctgcagaaag aaacattggt 
2041 tctcagctca aaatatgtta tcatcaacca ggtaatcgta tttaagacaa tccaacacac 
2101 cctgcttcac atgcaccttt caaaatttta agagatacaa ggtgataacg aattttattt 
2161 ataccgagag agtagactac atgtcgaacg tgtttaatga taagcgcaaa cgagaaaaaa 
2221 acaaccttca agaacgagta atgtgctttg gcatgcactt attaagagaa tagattaatt 
2281 gtatacagaa gccaaaatga tttgttcttg aagctacgaa tcaatctgta tgttcagatt 
2341 ctcgacgatc tcttctagct tgagttttcc ctttcctttc cctccacccc ctcccaggtg 
2401 ccttcatcaa aggtttctcc acatcgcctg agatatcaag atgaactcct ggatcatgag- 
2461 tctcgtactt tgcgcccatc attatggagt tgttcacagc ttgaaagttg ctgtttacaa 
2521 aagtggtcag gaaatcaagc tcttgcccac cgtttttgta gctgtctcca tggttatcat 
2581 caagctcggt cttcatggtg gctccaaggt tggatccaga gagcgtgatg gctctgatgc 
2641 catcctcctc ctccttctcc tcttcttcat attggttccg tcctgccttg tctgctccca 
2701 tgtgggttaa ggaggagatc atgtttttaa gatcgtggtg gaaatgttgg tgttggttgt 
2761 ccttcgctga ttcgcctcct cttgatttgc ttagcttcat tctaagtttt gcacaaccag 
2821 aagagagtat gggaaggcaa gtaatgcaaa ccatacatct ctaaatatat atatttataa 
2881 atatcactat atcagagtaa acgcaatgcc ttgtgttgta agtaaaggaa ggggaatgac 
2941 tttggaatgg gtttaagaat aacctttgga gattgagaaa tgtggttagg tatcacgata 
3001 agatgttgca gttaaataat tggatgatgt caaaaccgta ggagattcag agttctttca 
3061 ttgtaataat aaaaacaaca cacttgagac catgtcgata gttctcttcc aacataaaaa 
3121 ccaattggtt ataaaagatc gtcatcacat catagatgag ttaaatcttg acccgaaggt 
3181 tgaaatcaag tcttcttttt ccagctagaa gctttgttag gacttgcata catataacaa 
3241 ttagatatgg ggaaagggaa cacccttgtc tatgacatct ttcgctatta acatacctag 
3301 tcaatttccc catttatcta aaccgagaaa gatgtcagct caatacactt tttttatcca 
3361 aacacagagg agacaactcg cattgctctg ttttccttca cacatgcatc tatgtagcaa 
3421 acaacaaaac tgaaatgaac tcttcctgat ccattcccaa atcctttatg gttttgacgt 
3481 gtattgaaac ctctgatact ccacaagtac atcaaattct ttttgttcaa aaatactagg 
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3541 acctttattc ttagactttt gaaaattttg 
3601 gacccctcta ggaagagatt gtctttgagt 
3661 ttctctgcct cttcaggaac atctacacaa 
3721 cagttgaact tagaacggag aatattgagt 
3781 ttccttggtg tgttttctat tcactcttgt 
3841 caatcacttc cacggaagca caagcctcag 
3901 acaacctcag aatcagtctc tgcgtttttt 
3961 tttagtttgg tgtttgcaag acgtaatttt 
4021 gtttcattat cacccttttc cttctcactc 
4081 tgtttctcgc tagtagctct acatgctttt 
4141 cctctacatt cacccaccaa taggaattag 
4201 gaaaccaaat aataagaatt taatgaaata 
4261 tagcaattat aaaaaaaaat tttaatattg 
4321 accctaaatt ctaaacccta aaccttaaat 
4 381 aagcctgaac ccttgggcaa atcctatacc 
4441 tcataaacta aatcctaatc cctaaaccct 
4501 tcctaaattc caaaatggtt tatggtttag 
4561 aagagtttag ggtttagggg ttcgtgttta 
4621 ttaagggttt aggttttaag attaacgatt 
4681 ctttccccaa gggtttaggg tttagtattt 
4741 ggtttagtat tttctgacgg atttatcaat 
4801 ttttttattt atctaaaaat ataattttat 
4861 ttttaagaga tatcaaatct taattaacta 
4921 accttagggg gtgaagctaa gttttgttct 
4981 atgcttgaac cgatcgagat aatttttttc 
5041 tcatatatat tttttaaaat tctaaacccc 
5101 aatttaaatt agttaatctt atgtgtatag 
5161 ttatgatcct tttatactat atatttgtga 
5221 attgatcata cttttatgac tgaaaaataa 
5281 ccacgattag aaaacataat ctctccgttt 
5341 ttgttttaaa aaagtgtatt ttttatattt 
5401 aaattgtgtg ttttaaaaac attaattata 
5461 aggaaatata gaattacaaa aaactatgta 
5521 ataagtgtga aaatcctaaa atatctatta 
5581 tttgatcatt tttttttgtt tttttttttt 
5641 aaaaggaaaa gtaaaacaac aaaacagtta 
5701 agccggtagc caataacacg acggttaaaa 
5761 gacgtaattt acacgatcga gtaataagtt 
5821 gtagtaaaac aagtccatta ttatatcatc 
5881 cgcagacatg catcttcttt ccattttctt 
5941 aaacgaaatt aagagaacaa acacaaaaag 
6001 agatcaacaa aaacaaagta gactaagaaa 
6061 agtatccaaa aaaatatcat aacaatgaca 
6121 tctctgggct cctactttaa aacgtaccgt 
6181 ttgacgcttc tcctcttcct tctcgcagcc 
6241 agtaaggtat aattaattaa gaaaaataat 
6301 taatcttttt ttgttgatca aagacgacga 
6361 gaaaaccggt acccgagcta gaatcaccga 
6421 gtttattgaa tcaaaaccgg tctggttctt 
6481 taattattat cgagatcttt tcattacgta 
6541 tcaacttttt gttgtttacc acagacaaaa 
6601 aaaccggttc acgtaccagt accagtacca 
6661 tcggtagatt gcacaagttt cttgaaccaa 
6721 caacccggct ataataataa ccaaaccgaa 
6781 tggatccacg aggatctaaa gccatggaga 
6841 agaggacaaa cgacagcgca tttcaggtta 
6901 aactacaaga agtctataca gactagagac 
6961 ctgagaaagt atccagggaa gttgcctgac 
7021 cctgttatta gattggacgg ttacagtaaa 
7081 gcgttattta gatacggcgg agatagatgg 
7141 ttctggggat ggtatgtact agaatatatc 



attctgatag aaacttatgg tttatcctta 
aatcacctct ttctctttag tttatcttat 
gatatgtctc cactatttca ccttgctttt 
tcaatagtat ggttaggaca tctgctacct 
tccataaatt tggatcttct taagtttttg 
taacctcaca aaaaaaaaat ctccacttct 
tatgatactc gtgcttcctg tatccttttc 
ctcattaatg tttgttaccc actgagaaag 
aagtcttctt tctcaacttc cttttccaca 
atcaagaaca aaactttacc ctaaagtgaa 
ttaattgaga tttgatatct tttaaaaaat 
aaattatgtt tttagataaa taaaaattag 
ataaatccgt cagaaaatac taaaccctaa 
tctaaatact aaaccctaaa ccctaggaaa 
ctaaccctta aatttaaacc aaaccttaaa 
aaattataac ccctaatact aaaccctaaa 
gattaaggat ttatgattta atatttgtca 
gaatttaggg tttaatatta ttgtttatga 
taggatatag gatttgccaa aaggttcagg 
agaattttgg gtttagggtt tagggtbtag 
attaaaaaaa aaaaattata attgctacta 
ttaattaaat ttttattact tggtttcctt 
atttctattg gtgggtgaat gtagagattc 
tttatcaaat gaccccagtt accgtatgtt 
taaaattgtt atttttaatt ttaaatctaa 
tgaaaaaaaa aactccaccc ccttttccaa 
acatctttta ccttgttagt atactagtca 
taaacaatca ttttttggct ttactagaat 
ataaaaacaa atattatata atgggctaca 
cataaatatg tatgttttag gaaaaaaata 
tcagtgcagt tcttatcaac aaataataaa 
tttttaaaaa ttttatttgt ttaaaaatat 
ttaataacta agttttcata tatttttttg 
ttttgaaaca aaatgagtat atttttcctt 
ttggttttct caaacataca gctggaaagc 
aagcctaaaa cccacaagtg ggctctgttt 
tgtcgtatat gattatcata atacgaaact 
tagtagatga aattccaaaa ttcaaagagt 
atcatcaatg tcgtgtcctt gtcaaactca 
ttcctgctta aattttcttt aacttcaaat 
agagactgga agcaagagag agagagagag 
agattcctta gtctccaaag attccccctt 
tctctcttct ccaaagcccg agctctcact 
ccggtcaccg gaaaactcca tgtggccacc 
gctgttgccg tcacctcttc tctatggctt 
cttaatataa cgttatcaag aaaattaaca 
aacaatttga tacaccgaca ttagtcacaa 
agaaaaccgg agtactggta aatggcacaa 
actcagaaac atctctatgg cttaacaagg 
attaaaattc taatactatt ttcagtaatt 
tcatataatc agccgacaat aataacaaca 
gagaagaaat caacgaagaa aaccggaatc 
aaccggtccg gttcttgctc gagaacagct 
tcgaaccggt catgtcctga ttacttcaag 
gagacgggga taacgagaga aatggtggag 
gttatagtaa acggcaaagt gttcgtcgaa 
gagttcacac tgtgggggat tcttcagctg 
gtggatctca tgttcgactg tgatgatcgg 
tctaatcata cagctgaaaa tgcaccacct 
acggcggatg tcgtctttcc agactggtca 
tctacacttt ccctcctttt cttcttttgt 
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7201 tttctctgtc tgaagcgtgt agggttgtca acatgcaatt gtccccactt ttgatcacat 
7261 gccgcaaaaa aagattcatt tttttgttta tatatagtaa tttaaagggt tcgaaaaagt 
7321 agaattaaat gattgagtac tctccactca atgtcactac tagttaaata tgaggttgta 
7381 gctaatttat ccctattttt cgttccaaaa aaagctaatt tatttatgct ttgaataagt 
7441 gattagctaa aagtaacttt aatttttata tcgcgaacgg aactttaatt tttatataat 
7501 tcaatcttta cagcaaaaca ttaagtaagt gttttttttt ttgataaaag atttaacatt 
7561 aagtaagtgt tttgttccaa aatcttattg gtaaattcat attgaacttt gaaatattgt 
7621 tttgtatcaa ttgtcatatt tgtttttctg atgatttatt tgcatagttg aaaattttaa 
7681 taagttgaaa atcaaataaa ctgaaataaa ttggattaaa ttggtggata aaggcaagag 
* 7741 attaacataa agccatggag caaagtgttg acagaaatgg aaaaaggaaa gaagaagaag 
7801 aaatttatgg agagagaagc ttatgcatat tggaaaggga acccttttgt tgcatctcct 
7861 tcgagagaag atcttcttac ttgcaatgta tcctcacaac atgattggaa tgctagaatt 
7921 ttcattcagg tatatatata tatatatata tatatgcttc tataagctta tgttcgtacg 
7981 tataacccta ttttatctat taatttttcc atgaggtata ataacaatgt tgataaaaat 
8041 aggtataata acaatgaatt tgtgagcaat attttctttg tgtttacaca ggattggata 
8101 tcagaaggac aaaaaggatt tgagaattca aatgtagcag atcaatgcac ttacaagtta 
8161 gtactagtct atacattttg cctcatatag taccaatgtc taatttgagt tgaggttttt 
8221 tatatatatg ttgattttaa ttattgagat aataagttgt gcaggtacaa gatatatata 
8281 gaagggtatg gatggtcagt gagtgagaaa tacatattgg catgcgactc agttacattg 
8341 atggtgaaac catattacta tgatttcttc tcaagaactc ttcaacctct ccaacactat 
8401 tggcccatta atgataagga taaatgtaga tccatcaaat ttgctgttga ctggcttaat 
8461 aatcacactc aaaaggtatc attaccaatc ttttttttat ctcagtctag ataatctttc 
8521 ttacatgatc aattttccct ccaccaaact acaccttgaa taggatttta gcttaagtaa 
8581 aaaaaagtag atcccctagt ttacaaaaag taatttatgt atatatatag tactcgagta 
8641 attagtttca gttttttttt ttttgcgaaa taattagttt taattttgta aggaaagttt 
8701 ttcattttat ctttcgaata taatctacaa aatagttgtc tatacctata agttaatact 
8761 gtgattagat aaaatcttgc tatttttctt tattgttgaa ataaagtacc aaaaacacat 
8821 gttgcgtaat actttgaaaa gttggatcgc ataattatgc atactttttg tttataattg 
8881 gttgcaatgc aggctcaaga gattggaagg ggagcaagtg agttcatgca acgagatcta 
8941 tcaatggaaa acgtgtatga ttacatgttc catttgttga atgaatactc aaagcttctt 
9001 aagttcaagc ctcaagttcc ccaaaacagt gttgaaatct gcacagaagc aatggtgtgc 
9061 ccttctggag atgctaatga tactaataag agatttttga tgggctcttt agtcgatgag 
9121 cctcacaatt caggcccatg ttcgctacct cctccttttg atcccaacgg tctcgagaag 
9181 ttttatagga agaaactgaa tctcatccgg caagttgaga aatgggagca cgcttactgg 
9241 gaaaacgttc aataaaatga . ttgtattaaa ctgtttgata tgattttatt ttagctttta 
9301 tatatcggat cacgttagaa taatgataat tttgagaaat atataattcc ttcatcagca 
9361 cacagatggt ccacataata acatatgcag cttcttagca tgagtagatc aatttcctca 
9421 acctaaatcg agcccattga aacctgctga actaatagat cgtgttaata gcttttattt 
9481 atatgtagta cattatatta tatctaccat aattaaggtg aatgactgtg tgagcttgaa 
9541 gcttcttatt ttattttagc acggctcatt tacatgtgta ttttatttgt aaaagactgg 
9601 atacatgttt tgctatataa tagttcatgt atttgaatgt tcttgaagat taaaaaggca 
9661 tgtgaggcta ctgcgatatg attgtagtgg gaacttaatg accaatttca tgtgaatgct 
9721 taaaagatat gggacagttg tagttaaaac taaaaagttt gttttcctgc cttgacgaat 
9781 gacgaataca aattaagttc atctattcta ttaattcttc aacatgtcct attgatattt 
9841 agttgtgtcc aactgatttt ttttataact gtatctaata atttaattaa atcaataaca 
9901 tgttctcttt gtataataag cagttacgtt ttcttccaac actaagaaag agtttttgtc 
9961 ggtttatgat tttaaaattg ggccatgggc ttcgatttcg ttttaaaaat caactgattt 
10021 gttacatttg gtctgattgc aatggccata taaatttata ttgctttctt aaccacattt 
10081 aggtttatgg ttaatttaat tttaatagaa gaagactaat aatattgatt attaaaacca 
10141 taattttata taagattact attatatcta taataattta tagatatttg tttactatta 
10201 tatctattag cataactgta tgtatatata tatatatata tatatatatt atttatagtt 
10261 tgcattgcaa catttttaat aaactaaaaa attagtatat tttaatttta aaataaatct 
10321 tcaatttgta ttcataaatt tatgattaga aagttaataa taaaatagta ataattaatt 
10381 tcagacatat ttacagaaaa aaatacaaaa caacactata tatgcgcttc aagcacagat 
10441 caagatctaa tataggtcta acattttacc ggacccgaag aaccaaacta gaacggatcc 
10501 gaaaatacag gttcagatcc gagtctatgc taaaatattt attggatctt ttttatgggg 
10561 acccacaagt ttcggtttgg atctacgtcc tacaagagac ccaatcgggt attcgaaata 
10621 ctcaaaaatt attatatatt aggtaaattt ggatgatttt gtgtattttg gatatttcag 
10681 atattttttt aagtttcagg ttaaggtttt taggtataat ttcaggtttc gggtaaattt 
10741 tagatttcta aaaaatataa tttgggtgtt cgggtaaaat ttgagatacc ttttgggttt 
10801 tcggatctga ttttgggtaa gtttcagata tattttctcc aggtattttt aggatttgca 
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10861 ggtacttttc gagttttcaa attcagtttg 
10921 atatgattac aaaaaataga tttataatta 
10981 aaattattat tatttataca gtaaatcaaa 
11041 tgactacaaa aaaatatgat .taagaaaaaa 
11101 acatgtccta ttgatattta gttgtgtcca 
11161 tttaattaaa tcaataacat gttctctttg 
11221 ctaagaaaga gtttttgtcg gtttatgatt 
11281 tttaaaaatc aactgatttg ttacatttgg 
11341 tgctttctta accacattta ggtttatggt 
11401 atattgatta ttaaaaccat aattttatat 
11461 agatatttgt ttactattat atctattagc 
11521 atatatatta tttatagttt gcattgcaac 
11581 ttaattttaa aataaatctt caatttgtat 
11641 aaaatagtaa taattaattt cagacatatt 
11701 atgcgcttca agcacagatc aagatctaat 
11761 accaaactag aacggatccg aaaatacagg 
11821 ttggatcttt tttatgggga cccacaagtt 
11881 caatcgggta ttcgaaatac tcaaaaatta 
11941 tgtattttgg atatttcaga tattttttta 
12001 tcaggtttcg ggtaaatttt agatttctaa 
12061 tgagatacct ttcgggtttt cggatctgat 
12121 ggtattttta ggatttgcag gtacttttcg 
12181 gcctaatctg aatagataaa tatgattaca 
12241 gtgtatgccg gccaacttta aattattatt 
12301 taagatttat atacatatat gactacaaaa 
12361 tacaattaaa tatctagaaa aagttaaaca 
12421 tcagaatcta gattctgaag accgaatgtt 
12481 aaataaaaaa tcatataaac tttttaaaat 
12541 gggtctatag caaaaaacaa tagtttaaaa 
12601 taaaatcact gtgaaataat tcacaagttt 
12661 acggataaga attatttgga gtttttaaaa 
12721 ttcatacaat aattttagat aaatttcata 
12781 tacaaaatat tcatgacata aacttaattg 
12841 gggcgggtca gaatctagtt caatgttaat 
12901 ttatttttgt ttttaaaata attatgcgtt 
12961 ttttagtttt ctacacacat aacttgccat 
13021 aaaaaatata tatatattta cgatggcgat 
13081 tttgctcaaa aaaaatgtat gtgaatacat 
13141 ttttaaaatg atatatgttt taaagtaaaa 
13201 aatattttat tagatagtga attgtatcaa 
13261 ttttgattgg ataaaaaatt tgtggaaagt 
13321 aaaattttta atgtggatga aaattcaaaa 
13381 tagacgaaca tgacgtatat ttattgtgtc 
13441 cattattaat gcatatcata atgttagttt 
13501 tcacgtgagt attattggcg gtgaatattt 
13561 atagtatgta aatttgtaag acttgataaa 
13621 gaataaccta gtttgttggc aaataaatat 
13681 tgagtgattc aaatttacta ctccttccgt 
13741 tatttttaaa agatctacat tttacatttt 
13801 ttcgaaaatt taattttact fcattgaattt 
13861 aagacaagat tattcaaatt taatgtgttt 
13921 taactttaga aaatagaaag aatatatcag 
13981 ttcaattgaa gcttaaccac tatgttcaat 
14041 gactggttca aacgcatcga ttgcggttac 
14101 tagtttcaag cgttatttag agttttgtat 
14161 ttgcgaaata tttgtaactg gttgattatg 
14221 caatataaac atattataat atataaaaat 
14281 tattattaat ataatatgat taataataat 
14341 ttgaaagtta taatttaatt taaaaattga 
14401 ttgtttcgtt ttatatatgt atatatcttt 
14461 atctattgaa tacctagttt aaagttttta 



gatatgtcta ggcctaatct gaatagataa 
taaagaaacg agtgtatgcc ggccaacttt 
tagattttga ttaagattta tatacatata 
aacacaagta tctattctat taattcttca 
actgattttt tttataactg tatctaataa 
tataataagc agttacgttt tcttccaaca 
ttaaaattgg gccatgggct tcgatttcgt 
tctgattgca atggccatat aaatttatat 
taatttaatt ttaatagaag aagactaata 
aagattacta ttatatctat aataatttat 
ataactgtat gtatatatat atatatatat 
atttttaata aactaaaaaa ttagtatatt 
tcataaattt atgattagaa agttaataat 
tacagaaaaa aatacaaaac aacactatat 
ataggtctaa cattttaccg gacccgaaga 
ttcagatccg agtctatgct aaaatattta 
tcggtttgga tctacgtcct acaagagacc 
ttatatatta ggtaaatttg gatgattttg 
agtttcaggt taaggttttt aggtataatt 
aaaatataat ttgggtgttc gggtaaaatt 
tttgggtaag tttcagatat attttctcca 
agttttcaaa ttcagtttgg atatgtctag 
aaaaatagat ttataattat aaagaaacga 
atttatacag taaatcaaat agattttgat 
aaatatgatt aagaaaaaaa acacaagtat 
aaaatataca tgtcattttg aaagggcggg 
ctcaaatatt aaaaaaaata gtttcattta 
acacaaaaac agattttatc aatacataac 
atattaaact ttttatttaa attaaaatac 
aacagcaggt taataaacaa aaatcataat 
ttttggtcat atttaaaata acaaaaataa 
aaattcttaa caaatataat ttatcaagga 
tttcaacaaa aaatatatcc gccctctgaa 
cttataagca tgaataatta tttgataaat 
ttatcatcta ctactttcta gtcacttgtc 
ttcaaaaata tttacgatgt taagttcaaa 
attttacaag gcatttggtt actcgttttg 
taaaacatca aaaataaata ccctctccgt 
gttgttttga aaaatatttt ttatattttc 
gagtttaaaa aactaattgt atttattaaa 
aattaatcac aaaataatat atttaaattc 
acctacattt taaaaatcga ttgagtgtat 
aaagagcgta gtaactgtta ttggttgtct 
gttaattttc attaaatggc cttttccttt 
ttaagtctga tttttctcat acaaaaggac 
atgtttttta aaagaaaaat gttaagaaca 
ttatccttaa acatattcca tactttgttt 
ttcttaaagt tagatatttt agaagaaaaa 
caatgcatgt tttattatct . aattgcaaat 
ttattggctt aaaattatgg aagaaaaata 
tattaaaatt tgtgaaaaat ctagaatatg 
aaaaaaaact aaacgtagat ctgacaattt 
atctttgacg aaaatgtctc gtaaaggqct 
gggtgcggga gtttgtggat gtggtggctg 
gattggcata atatttttga atttttacgt 
agatattaca acagtttaat aataaattat 
ataaaaacat gctattgtga taaaattaat 
gttgtattta ttttttaata ttttaattag 
agatttatat ttaaaaaaat aaagaagatt 
atatatgtat gcatattaat tttaaaaagt 
caaaacaaat tatgataccg tttgtgaatt 
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14521 tgaattaata tatataattt atatattttt atttatatta tttccatttt aattttaaat 
14581 tttaaaatgt ataaaaaata tttttttatc tatgcgcaac cgtaaacgct aactgcaacc 
14641 agcttttgat tttaaaaggt ttgaagcggt ttgaaacgat ttatagcggt ttatatgatt 
14701 gttttgaaac attgtcaacc gctatcaacc gcaaaaattg cgtttgcagg tggtagcaca 
14761 gaaaccagtg aagccctaaa tagtgtaaaa tatctaaaaa ttatacctta cttttatacc 
14821 tagtatgtaa gtttcctata taaatttggt gtcatcacaa agtttcatta tggtccttat 
14881 cgatttccct tagtgtatac tcaatgtttt gcttgtaagc atcgggtgtg acgacttgga 
14941 acctaattca tgtccaaaca gataagcata attattattt gtcattcgta tatgtatatt 
15001 attgtttagt taggaaatat ttaacgtata tcgttttagc taaaatcgtg atgaaagaaa 
15061 aggcactaaa aaccgatata gttaatttaa gattaaatac ataaataata gaaatatcaa 
15121 aagacaagag atttaacgca ttgacgtccg gtgtttttct ccggagtgga aagtcttttc 
15181 tcccactaca agaaaaaatg cttttattag cggacgaaaa acgctatcta acgatacgat 
15241 ggcggtttat gaagcgttgt atcatcggcc gtcataatag atcagacata ttagatagcg 
15301 tttttttgcc ctgctatcgt attgttatat attatagcgt tttctgattt gcaattaaat 
15361- attctttaat agcgtttttt tttgttatta tttaatattt atttgtttat tttaaaatta 
15421 tttaatttaa ttattaaaaa taaaaatata attaaatttt-agaattgatt"" tatatattta 
15481 aaatttcttt attatttaaa cctatttaca attaattggt ttacaaaaat tgaaaaaata 
15541 aaaaaaataa atctaattag ataaaaaaat actaaaccaa ataaaccaaa aacctaaacc 
15601 aaagcttaaa cctaattact aattaactaa cgtatgccgc cgcacaccac cacctacatc 
15661 gtcttcttca tcttccaatc ctcagccgcc tcaccaccac cttcgtcgtc ttctttaacg 
15721 cccttcgtcg tcgcatcccg ccatgattct tcttcttcgt tgtgggattt gatagtcaag 
15781 ttgcagagag gaggcttact cgtcgtcgca tccagccacc ttcgtcgtca ttgtctcctc 
15841 tccacgaggc ttgctcgccg tcttcccggt . cgtcttctcc aagtatcttg agtgacaatc 
15901 gcagcagcag agttcagagt cgaaggcgcc agaggcgaag tcgaaggtgc cagaggcaga 
15961 gacaaaggcg cattgctaaa tcaaactcaa catccgaact ttaaaaaatt agttgagaag 
16021 aagataaaag atatagcaga gaagactcca aagggaagaa gaaagacaga gagagtagat 
16081 ctgtttctcg tagagaagaa aagaaaaaaa aaatgaaaca aagaagagaa gtacaaatga 
16141 attgttgtcg ttagatcaaa actaatctta gggtttgtat tgcaatccac gtgattcatg 
16201 aattgaccaa tagacaaact tctttatttt tattttttac ttttttttcg acaacctttt 
16261 gtgtttttta ataacaatat ttttattttt tgtttaaggt tcgtgtacgt ataataagta 
16321 ataactatag attgatttaa aaggttggag tttgatttat gatttagagt tttattacag 
16381 tttaatgttt ataatataaa ttttgtggta tggggtttag ggtttatagt ttatggtaag 
16441 atttaatatt aaataatgaa aataatttgg tgtttaaatt ttcttctaat gcatatagtt 
16501 tgattgaaga tttaaggtct gtgtgaatat ctaccgaatt tatatttata aaatgaatga 
16561 tatataactt tgacaaagtg actacatttc atttttagtg aaaatttata aatgcatttt 
16621 gatgcattat ataactttga aaagtgttga taaagtttgg atctaatttt tgaagcttat 
16681 aatagaagta ggaatatagg gtttgagggt taaatttttg ttttaagagt ttagggtttg 
16741 aaaacttgct tatagtttag agatttaaat ttaagatctg atattcaaat aagaacaatt 
16801 ttgagtttta ggggttgtgt tttaaaaaat tgggttttag ggtttgaaat ttagggtata 
16861 agatttagat ttttagaagc aaatgaaatg gagtaggagg gaaagtgaaa aaagaatgag 
16921 agagaaaaac aattatattt ttgccttagt tatacatagc gttttgaaat tgatagtgct 
16981 atgtataatg catttataaa aagagtttgt agcaataata acggaccaac caaacatagc 
17041 gtttccaaaa atacaaacgc tatctaaaac taacggatat gtcaaccata gcagaaagcg 
17101 aaaaaacgct atcctggtct gctattaaaa cgcatttttc ttgtagtgtc ctcttttcta 
17161 tttcattttc cttttgccct tttggtggat cttggccttc ccgacgatat gcatggtgat 
17221 ttatggagtt tggtggaggt tcatgatgta gatatagccg tggatcagca agttgtgtga 
17281 agtggatcca ccgggatggg tcagcattta ggttgttgtg gctgtaccgt caaacgtttt 
17341 catttttcgt ttgacagtag catacatcaa gacagtaccg gtactaagtt ttatcgggcc 
17401 cattcaaaaa taacacatct atatatatct ttttagtaca caatttagtt tattgtaata 
17461 attatatatt tttaaaaact aatattgatt aaattgactt ttatttgttt tagtatatat 
17521 atttcattaa cttaattttt attaatagaa aatttattag tttgatatgt tttgatattt 
17581 ttctaattaa atgtataaaa atatcaattt catataaaaa atttgcaact taaaaaaaaa 
17641 atgaaaacca caaattggaa ccatgccaat atttcatggg tttttagtca agcctggcac 
17701 tgcatcgaga tgaaggcgta tagtttcgat ttcgtatctc ctctatcgac cgagacgtgg 
17761 tttattagtg atggacctaa ctcacaccca aaagctagct caagagtgga ggattgtcca 
17821 atcacatata tattgcccaa ggatcacatc attaaacgat gtgggacatt ctaatagccc 
17881 ctctcgagat gtgggtggga aacggtccaa cacggaaaca gcccaagccc aacatctcgg 
17941 acataccaca gcaagatggg ccatttttat aggctacata agtagacaac tattttatat 
18001 gggaaatcat ctgctgggct taaccatagg ctctgatacc atattagtga tgggcctaac 
18061 tcacacccaa aagctagctt aatagtggag gattgcccaa tcacatgtat attgcccaag 
18121 gatcacatca ttaaacgatg tgggacattc taatatggtt caagattgaa attagtgggg 
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18181 cgtgttaggg cgtgaagagc aggctcggca 
18241 tgtgaatgac gaaggctttg tttccttgtt 
18301 aatggtgtta acttgattgg cttgtgagtt 
18361 gagtcgtcta taactcaacc ggaaattgga 
18421 gttccgtcaa caacgtgatg gtggctttag 
18481 gacgtgtgaa atttcaattt ctcttggtca 
18541 cattggagat atataatttg cagccgacaa 
18601 ttctgttatg aggtttgtgc tagagcgttt 
18661 ggatgtctgg ccaatcaagt ggaatcatgt 
18721 ttgctgacat ttataaggag aaaagtctta 
18781 aatatatatg cacctcttgg tgttgtaatt 
18841 ataccatctt cttcgggcta caaatggagc 
18901 atatatcttt ataaccttat tttagaggta 
18961 tgaatcatgg aggtctctca tctcactgtg 
19021 atggatctga gtgtgtgtat acatccacta 
19081 gtgtatgtfcfc aaacattgat atatccaaca 
19141 acatatggaa aatcaaatgg tgatagtata 
19201 cacatcagtt ttgtgttcaa tttct tttaa 
19261 gactttgaat tttgctataa gttgaatttt 
19321 tagacgatcg atctactcca aaacattagt 
19381 tatgataacc aatcttttat ataacactaa 
19441 cgacatagta tataaaaaaa aaatacacgt 
19501 ctcaatttta tgaagttgcg atcagttaag 
19561 atatatatat atatatatat atattgttat 
19621 ttaatacttt ttggaaacat aaataacatg 
19681 gaagattatg tattttcttt tctacacatt 
19741 ttctttttat atctcttttt atatttattt 
19801 agcagattaa gtacttcttt ttaagttttt 
19861 aattaaactg taattaatta attaatttaa 
19921 ttggtgataa aaacaaaaac taattttttt 
19981 ataatattat cttatttaat tttgtcaaat 
20041 aaaacaaaat atcataaaac taataataat 
20101 aattagtata atgttttatt ttaaaataaa 
20161 taatacactt atatcaaatt gcatcaaatt 
20221 aaaaaaatta ttaacaaaaa tctatattat 
20281 aaacataaaa atatatatga ttttttataa 
20341 aaataaagaa acgtagaatg tactagaaat 
20401 ttttatagta tgacatattg taccaaacgt 
20461 attaataaaa aataataatt atttaattag 
20521 aattatttat tttccaagct ttgttgtgtt 
20581 tatttaagtt tgattttttt tttttgaaca 
20641 acagagttag agagacaaga tagtgcagcc 
20701 ttacgagaaa caaactcaaa cgaaacagaa 
20761 aaaagaactc cttgtgcttc aacaatcttc 
20821 ttggaatctg atcgaatgac cagcttacgc 
20881 gctttaaccg caagagcttc tgcaattaga 
20941 gagcgatgga aggtcttgtt gctcatatct 
21001 gtgccagcat tccacgctgc atctacaaaa 
21061 tgacgagcct agagaaagtt atctgatggt 
21121 tgggcggttt gccaaatcct tgcttccttc 
21181 ccctctcggt tctcaaaaat caacatgttt 
21241 ggatagagat cagtttcacc aaggcccagt 
213 01 tgcaaaagag cttccgggga ggtgatgatc 
21361. agatcccaaa cacgacgggc aaatgggcat 
21421 aatccacatc tcttacaacg cccttccact 
21481 actggcaaat ctttactttt gatcttccat 
21541 atttgccata tacagaggtt ccagttgaag 
21601 cagtatgggt agcccgtctt ggttgagtac 
21661 tcatccttca tgtggaactc gctaggaact 
21721 aagtgtagtc taatcgcagg cagatcccat 
21781 actttcagct ctttgttctg ctctgtaggt 



cgtgggtggc tccctatctt caacctgcct 
gacaatacat aaccagagtt taaggaggct 
acgttctgtt gagcttactt tagtcgtgga 
aaaacaacac ttcaacagtg tgagatcgaa 
gaatagttgc actcttgttc tcagatgcag 
atgagggagc ttaaaccttg gtaaggctta 
aattgtttta tccaatagat ttgtctcttc 
gttgaagcct caatctcaac ccaactttca 
tgcttttggc tacatattcc aatataatgg 
gtccctggct aaccgagacc attgaatgag 
ggaggtcatt tttagcaatc tgatcattcg 
ttttttgttt tgtaatagtg gaaggtttag 
aaattaggta gtttaagagg tattttgctt 
gttaaaaata cagcttgtac tctaggttgt 
aatccttatt ctgaatttgt ccaacattat 
cgagaaattt'tagaaataaa caaatggaac 
gtgataattt tttggtgttt gatatgtatt 
aaactaaatt atcgttgttt ggtaagtttg 
ggtataagtg attaacaagg tggatagtaa 
cggaaaacaa ttcaatcttg gattagacag 
atatgttttt ctgagattaa ctagattaac 
atgagtaatt tgtcccaatc ttgtttgttg 
gaaggccatt tatataattt agatatatat 
aaaatttata ttattaaaaa tgaagtaact 
ataaataaga attgtataga aacatgtata 
tagccattgc attataataa attaagtact 
ttatttacat attctttttg tatctttttt 
ttttattttt gtatttcaat ttaatttatt 
attaattaat ttcaatgctt attgtttttt 
agtatatatt atataaaaaa agtgttaaat 
ataaaaattc cgagaattac aattttcaag 
tcatagaaaa ctaatgcaaa aaaataaaaa 
ttaacaaaaa ataacaaatt aatatatgaa 
ataattttaa atataatatt atgcaaaaat 
aaaataatat attttactct agatgtaaca 
aaatagcgat ttattaaatt agttgtaatt 
aaacgtgggg gtttgttgga tccatctaaa 
tggacatata tgttctgtaa ctttgttgac 
ttgttgttgc ttcttttaga tcatttgatt 
catttgcata atatttttat ctactattct 
aaccttcgtt acttcaaata ctcaaactaa 
ttagccacat catctgcctc ctcattgtac 
ttaaaaagag tacaaaggtg atcaatatca 
tctttggtgt taatggccat gataagggac 
aaacccagag accgtgccgc ttttaagcca 
gcagagccga cgatgctgcg- gtttgatgaa 
ttgaagatgc acccaaatcc tccaccactg 
cattgagcct ccacatccaa gttagaaacc 
gtagataggt tctgtttcgc tttctcttgt 
agtgctaagg ataccagttc ttgctctgat 
ctccctatcc acaagtacca gaagatccaa 
ggtggcagat taactatgcg tcgcgaggtt 
gctgggtctg gtttgaacat gacaggcacc 
tgtagaaaaa catgacgctc tgtttctatc 
tgaatccctc tgtgtaaaag atttgcacca 
aggaaatgtt tcagcttcgg ggaagttcta 
tcagfccagct cttcaccacg atttgtcttc 
tctccagaac gtgtatggag ccagcagagt 
agtttcaaaa .tgtgttcctc atactgaggt 
tcctttgtaa tgccattcaa aaggtccgat 
ggacccattg gacacattgg agtctttggg 



95 



WO 03/006622 



PCT/US02/222J7 



21841 gagagccaag gatcagacca ggcgttgatg 
21901 ccttggcgta acacctctct gcccgctata 
21961 ggagctggag ttgagaggaa tggggtggag 
22021 agcaaggact ctggaaactt cagtattcgc 
22081 aacttctcaa tctcccgaca tcccaaaccg 
22141 caagcaaccc aagacatttt acgatgatct 
22201 acagtttgta tctgtttgca gagtgactta 
22261 ggcatagcag agagaataga cttcaagagg 
22321 gaggtccagc tatgcgactt ttgcctgatc 
22381 ttctttctgt tgaagtgctc tgggaggcca 
22441 atgttaagct ccgctttgac cctgcgtctt 
22501 atagcagact tcagacaatt gatgctttgg 
22561 ttctttagtt ctttgcaaga tatcgggtca 
22621 aagaggagat gatttagagg tggtgaattg 
22681 tgttcaaacg ccttagagca gagtcccgat 
22741 ggagacagcg ggtcgccttg tcggatccca 
22801 ccattgatga ggtatgagta cgaaaccgag 
22861 gtagggtgga agccaaatcg gaccatgaca 
22921 taggccttac tcatgtccgt tttgacggcc 
22981 cagaggtaat gaaggatctc gtgagttatc 
23 041 aggctgattg gtgtggagag atcaatgaag 
23101 ccttcgcgat gaccttatag tgtgtagagc 
23161 tcttgggcgt tttgattttt ggtattaacc 
23221 gttgacccga gataaagaaa tctctgacct 
23281 aaaatccctg atagaaactg gccgaaaaac 
23341 tagcgaacac tgctcgcttg atctccaaat 
23401 tttcagggga gattctgggt gagatagcat 
23 4 61 ccgatgtgaa cagggtatca tagtactctt 
23521 acacagcagt accatcttgt ttctcaagaa 
23581 tagtaatggc atgaaagtag ctggaattat 
23641 tacgttgacg ccaatacttc tcctcctcct 
23701 tgccaatgat atctgtatca ggagtactgc 
23761 ggctctggat agactccttg ctattcagat 
23821 ggcgtactcg acaaagcctc gaccatttaa 
23881 tcttttagaa catgttcagc tgagttttgt 
23941 ttttactttc taaaagagtg ttaattgttc 
24001 aacatgttta tttaatcaaa tcgcaagttc 
24061 ttctgaagca acttaaaaca ataatatttg 
24121 aaactagatt ttttggtgtt agtgttgaaa 
24181 tgcagatata gcaaatttca tcagtaagag 
24241 atcattttta gagaaaattt tctacgaaga 
24301 ctctatagat ttagtgttta gatttgagga 
24361 gtttagagat ttttttaaaa atataataaa 
24421 tgaatttcca aaagataaaa aatttcgaat 
24481 tttgtcaact attttgtttg tttttgttta 
24541 gtaaggatat aagagttttt ggctttttaa 
24601 tagatttttt aataaagaaa cctattttat 
24661 aggagagtat acaacaaagc acaactctta 
24721 atccttggac gacccctcac tgaatcgact 
24781 gaatctacat ggttatatat ccaaaaataa 
24841 caatcaatat aaacctccaa gcataagctc 
24901 tcttttaccc tcactataat agtggagtag 
24961 gtattgaatg ttgaccaaaa aaaaaaaagt 
25021 aaaaaagttt tatgtaaaag aagagtatat 
25081 cggtgaacag attttattta ttccttttta 
25141 gttgaccaaa aaaaaaaatc cacttatcat 
25201 ttgcatgtat atgttcaacg actgattaaa 
25261 cggaagtact gtatgtccca ccataccaat 
25321 accacagaga tgttcttaat gattcaagtt 
25381 ttagcaacaa tactatactc tatatttaag 
25441 cactattctt tttttttttg tgggcaaaat 



tccgatccgt taccaataac ccatcctagg 
actccacgcc aaccatgaga agccgattta 
tgacagtact tccctgccag tgtttgagcc 
caggccaact ttgctaacag agcatcatta 
ccagctctct taggtagagt aagagtactc 
ggtttatcgt cccaccaaaa gcgagtgagc 
ggcaacttga agcacgtcat cgcataggtg 
accaacttcc ctgcgccatt gaggtaccta 
cgatcagtaa tgctagcaaa tatgtctctc 
aggtatttac ctataccccc ctcacccaca 
gtttcaatgg gtgttttggc agagaaggtt 
ccagaagcaa. cctcatattt ctcaaagatg 
ctcttgcaaa agaacatcgt atcatcggcg 
cgcgctacct tgatgccaca cagctttcct 
aagacctjcag agcacaaaat aaataaatag 
cgcgacggta caacacaacc gtgagcgccc 
ctaacacatt ccataatcca atgcacccat 
tctctcaaga aactccactc cagtcgatca 
atggtgcaat gcttctttgc tgatgaggta 
agaacattga aatagaacga ccagccacaa 
ggaggattgg tttcagcctt ctgcacatga 
agagtgctat gggcctgtaa tccgttactg 
ggacgtgagt ctcgttaaac cgacgatgaa 
cccgacagac ttcttctcct atgatgtccc 
cgtcgggtcc cggagctttg tctctatgaa 
catctggtat acttgtgagt gttgcattca 
cctctatgat ctgagatttc ccttgagacg 
caatagaggc tgctatctgt tcttctgcaa 
cagagaattt attggcagcc ttgcggcttc 
ggtctccgtg ttgcagccag aggattctac 
cgtaggcttt gtcaagctcg tttgtcaaca 
ttgatagcgc ttcctccaac gccttttgga 
tttgttctct agaccatttg ataatagcaa 
gtttgattta ttttcatgaa accactgatt 
agcttttcta agattcacac ctctttcttt 
atatgtatct actttttcac gactcaatgt 
aatattacac aaaaagccaa tgtaataaat 
ttttggatat gataatctaa ctagcttgga 
acttcatatg cgatttgatg acgatgtttt 
atcattctaa tattggcaag tcaatcatgc 
ccctaaattt gttattatca taaatatagt 
gtgaagtttc ttttttagga aaggagtagg 
aaaattttaa aatagtttca aatgaatttt 
tcgaaaacat ataattcgaa aactataatt 
aataattatt tatatctata tatctattaa 
taaaaccttt ttaattcatt tttctccttg 
aattatttga gacaactgcc ccttacaaaa 
cgagtcaaaa ctgaaagcta cttggtaaat 
ttgtctccat ttgatgcatt ttcctattat 
tagttgaaac aatatgtttg tgacagatct 
gcacgattgg tggcgtaaat tctgacggac 
tattatatat aggaacacaa ctcattgcaa 
cttctaataa tttatgcaaa ctttgattaa 
ttaaattact cttttacagt ttttttctca 
taaaaattac gatataaatc tacttgtcat 
tagatcgtaa aaacgctcta gcgttgcaac 
atacaattca tagatgacat atacggacta 
agctagccta acaggagcct agatcaccgt 
aaatcattta acgcgctatt acttcaatga 
aatcacccaa cgaaatttga taatttcaaa 
ttccaacact attctagata gactttttgc 
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25501 gattggaatc aaaagctagc gtgtatatat 
25561 aatcatttct aaaaacaatt tgtactcgta 
25621 tagtgatttg ccattatgta taaaataaaa 
25681 agaatacaat tgtagatatc -tgaatggtag 
25741 ggatatatcc tgatcagcac tagaattatt 
25801 tatatgaaat tgaacttata aaaaaatgtt 
25861 ttattatgct attattttaa tttatatttt 
25921 aaagacctgt ctttgaatcc aagtctctct 
25981 ctgctaattc tccaacttac ttttttcccc 
26041 atattagcta aatcctctaa tctctaccat 
26101 tcaaacatct ctgtttcttc agagtagtac 
26161 aacttgagag tgaatctcca gaaagaaaga 
26221 ttctacggct gtctcttctt cttggcctct 
26281 acgaaggtat gtgaagctta tttgcttaat 
26341 ttttctccaa ccttagtcta gacatctcaa 
26401 tattttcccg agaaaatcag tagaatctgt 
26461 tcatctctct aactctctgt gtacagttga 
26521 tgaagaccct ttgctcgtta tgtctaactg 
26581 gtccggcgtt acttgctctc cctctaaaga 
26641 tctgcaacaa catcacttct tttatgagtg 
26701 gtcgtgcaga aacatatctg cttcatctat 
26761 attaaccttc cttcaggaac tgtacgcttt 
26821 tatagtttga tgtttgtgat tttgacaatc 
26881 gaaccttcta ctcgggacaa taccaaagga 
26941 ggacttagga aacaatcatc tcatgggtcc 
27001 cattatcatc atgtaataaa aagactcaac 
27061 ataccttttc ttcataatga cgcttttgtt 
27121 gtttaacggg aaagctgcct tcagagattg 
27181 tcgacaggaa taggcttcag ggaagtcttc 
27241 aattgtgagt ttagcaaaat agtatgaaac 
27301 gggattttgg ttttactcaa ctctgttgca 
27361 ctggtctgtg caagtcttct cagttgaaag 
27421 gaaacattcc aaagtgtttg gagcactttc 
27481 ttcgagctct agacatctcc agttttgctt 
27541 ctaggacgag ctttcaaggg aactgcatgc 
27601 cccaatgcgg tgtgtaagat atagcttttt 
27661 gtttttacta aaatgtaata atccgtgggg 
27721 gaaagtccca gctccccacc aaagcaccag 
27781 acatcaagac ctaagtggct tcttgcgctt 
27841 ttctgcctgg ttgcactttt ctcagcagtt 
27901 attccttgga agaaatcttc aagcctaaag 
27961 tcatcatctt tccaaaaaaa tcttccaaca 
28021 gctttttgtt ctcttattac tccagattct 
28081 aggcaagagc ttgaggtggc gtgtgaagac 
28141 agtcaggtct ataaaggaac aatgaaaggt 
28201 gtcaaagaag aagattggac cggatacctc 
28261 gctcatgtta caaagaaaaa ctgagccttt 
28321 cagatttggc tagactgaac cacgagaacg 
28381 cctcaccgtt tgcaagaatg cttgtttttg 
28441 .acctccactg taaaaatctc aacttcctca 
28501 tgttttagac tatagattaa cattgctatg 
28561 ctttggtatc. atgggcaaga cggatgaaga 
28621 accttcatat ggaactggat cctccattta 
28681 atcttacaga agattttact cccaaggtaa 
28741 aacaatacag ttaagcttat gtttctatga 
288 01 cgattcttgc gagatcagag aagaatttga 
28861 tgctcccaaa cgcaatggag agtcgatatc 
28921 gtgttctttt gctggagatt gtgagtggaa 
28981 taattgaatg ggtaaagaag aagctattct 
29041 tcctttttct ttacaccccc cgttacatgt 
2 9101 cagaggcaat gacgagtttg gtggatccgg 



tcttgttatt tgtagtaaag ctctgaaaca 
aaaaatatga tacgatttta acagatacgg 
atagcaatta cgattgaaat ttcacgcaaa 
tacaaatcaa ttcctgattt ataaatattt 
ttttgtcaca taatttgtta ctattattat 
ataaattatt tcattgcaat atttatagat 
aaaagacatt ttctaccatc caatgaaaca 
ctctctctct caccccacac ttcaccatct 
ttttccaagt gagtttcctt tgataatagg 
ttcattctct ctctttgcta atcttggagc 
tccacgaaag ttgccgtctt taatgagtcg 
aaatgagatc gaactgttgt tctttacctc 
tcttcatctc ttgtgatgcc ttcgcctcta 
cccagatctc agattctttg gatcgaatcg 
tcgaccatga gaaaaaaaaa aagattacta 
caaagttaaa agcttttaat ttatacaaaa 
agctctaaga agattcaagg aagcaatcta 
gaacgacccc aattcacatc cttgcgcttg 
ccatgtcatc aagatgtaac actctcttta 
agatttggct tgttgatttt gatctttctt 
taaagggttt cttgcgcctg agttgtgtca 
ccatctctct ctctctctct tgacataata 
attgaatgct tctggtagag tcttacatgg 
gattggcaag ttgagtaatc tcaagatctt 
tatccccgct gaggtcggga gtttgtccag 
cttttacagc aaccaaaact gacggtttca 
tcacattttg cagaaacctt cagtccaatg 
gtaacttgaa gtaccttaaa gaacttcata 
tcgccgctgg atcatctggc tatccgtcaa 
acttctgtct gcaatggcaa atgttttaat 
ggttttcttc aaattcaagt gctaatactg 
tagctgactt ttcatacaac ttctttgtgg 
caaggtattc ttatcatcta ggttgtcttt 
gagagtttct cctttttttt atggtcaaaa 
aaaacaagga tcttaagcac agaccatctt 
tgttttcatt tcatggccca gctggtcttg 
gcaggtaacg cacagctggt caaaactcat 
tcagctgaaa ttgtggctaa gcatcataaa 
gagatagtca caggatcaat ggtcgctttg 
caccgctggc aaaacaggcc ttctctcatc 
gaaaagttcg cagtctacgt tggttagaaa 
ttccaattca aacaactcaa aataagtgtg 
gaaatgctca aggatgtctc gagattcaca 
ttcagcaaca tcattggttt atctgctgat 
ggacctgaga ttgctgtgat ctctctttgc 
gagctctatt tccagagaga ggtagcttct 
tctaagccat ttttgcactg tttcaggttg 
cagggaaact gctgggatac tgcaaagaga 
agtatgcatc aaacggaacg ctatacgagc 
tcagagcttt tgtgttgcat catactatac 
ataaatgttc tgatgcagat ggggaagcgg 
ttgtcatagg cattgcacgc ggtctcaagt 
caatctctga gctgagctca aacgcgatct 
aaagtaaagt gagaccagct ttgttatctg 
tgcagctggt tgattttgaa tgctggaaga 
gaaatattaa cagtgaaggt gcggtatgcg 
tggatgtgtc tggtaatata tatgcatttg 
gacctccttt ctgcaaagac agaggcttct 
ctctgaaaat accttaagct gtgagtaaac 
tgttgcaggc aaaggagtat cttgaaacac 
agctgaagca ttttaaccaa gaagagcttg 
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29161 aaacggtgtg tgaggtggcg aggcaatgct tgaacaggga tccgaacaac aacagcaaca 
29221 ataacaacaa gccatcagtg caggagttat gtgagacgtt ggagagtaga atcagtttgt 
29281 cgatttccgc agagcttaga tcatcatctt tggcttgggc tgagctcgcg cttgactctt 
29341 gatgaggtag aggagagtcc tgaatgatga tgaatggcta actggaaaca atgtaaatca 
29401 gtcgacacgt ttaaaagttg ttattgttct cttatcttga caagtgtgtt gagtttctta 
29461 cgtgttgctt tgattagtgt gtttacagtg tgtttgtaaa ggaagagata aacatagagt 
29521 ttgtaaaaag attttttaaa tttggaaatt ctatgagagg ttaaagaatg tgtgtgtagt 
29581 gtgaaatagt tttggaactt gtatgatgct atctttatgt tatacatacg gtgtcttcca 
29641 tattcattca gacaaagagt ttgctggttg aattgaaatt tcataaacgt cggattcaac 
29701 ttaataaaaa agtcttttga ctattttaaa aagccgccaa tttcatggct ggtacgactt 
29761 atctgttctc acgacgttga gtgccatcaa agctcagacc gccaggtata atagattcca 
29821 tcttacctta aactggatat gtattagtaa atttcccgtt gaattttttt tttaaaatac 
29881 attgtttttc tagcacgcaa cggttgcata gtacgccgcc atatgccaaa tatgtaactt 
29941 gaaagaattt acttgtcttt ttcttgcaaa tggagaagaa aataagatga caaaagaaaa 
30OQi..acaagtaaaa ctacttcacg ctatagttta tatacagttc acatatattt tatattctct 
30061 tacgttcatt t ttgaaatat- t ttaaataaa aatttgaatt ttactactcc atagtatagg 
30121 atgtcataac agtgacataa ttatttaatc aaccaatatt ttagtgaacc taataaaggc 
30181 atataccatt tacttattta gacactgaat ggtcaaaacc atactacgga atttagaatg 
30241 gtatttgtaa aagcactttt aatttaccaa ataacagcaa ggcaaacact taaattttcc 
30301 actaagtaat ggaatacaga tcagaacaca cagaaaacaa tagtaaatta ttttgttcag 
30361 ttacactgat tatttttaca tattactaaa atatgttata aatacagttc tcatagtctc 
30421 atactggtta ttttctttat ttctatttta ttttaaatat gaaaattata ttacctattc 
30481 ttactaatat ataaatgtta taaaattact atatattttt tttaaaaaaa tatttcttta 
30541 taatataata tactttggca atagaatttt gtcatataac cataattaat ttcactgaaa 
30601 gaattagata gaagagagta tttacggaaa gaacccgata gtttggatat ttatataatc 
30661 ggcgcatata ttttgtcatt taccaaccga ctcctttttc cctctcatct gatctatatt 
30721 taaaaagctt accccattcg aaagtataaa cgtaaaagga atcaaaattg taaacgacta 
30781 aacacctggt ctttcctccc ccaatggcca cgaagctatt gtcgctcaca tgcatacgca 
3 0841 aagagagatt cagcggccgt taccctcctg tgctgcggaa acatctaaac agcaggccac 
30901 gcggcggcgg agatggatca ccgtcggaga cggcggtgct taagatcgac gaggaggagg 
3 0961 cggtctccga tgcggttttc cgcgtcacgg ggatgacatg ctccgcgtgc gctggatcta 
31021 tcgagaaaga gatcaagcgt cttcccggga ttcacgaagc ggtcatcgac gctctcaaca 
31081 accgggctca aatccagttc taccctacct ctgtcgatgt aagttttttt cttttttttt 
31141 tttgattgga aattcaatat ctgcttcttt ttattgttat atcggaatcg atcaagttgc 
31201 gtatctcttg gttgatgcac ttcctgaacc tacatgataa cgtgttttta tttactgtgg 
.31261 taaattagga taataggttt cgttatctat tttaatctgg attgttgtac tgattgggta 
31321 ggtggagacg attcgtgaga ctattgaaga tgctggattt gaagcatcac tgatagaaaa 
313 81 cgaggcgaat gagaggtcca gacaagtatg caggataaga ataaacggta tgacttgtac 
31441 ctcatgttct tcatccattg aaagcgtact gcaatcactt tacggtgtac aaagagctca 
31501 tgttgcctta gcaattgaag aagctgaagt tcattatgat cccacgctcc tgagctacga 
31561 taaactactg gaagagatag ataatgctgg atttgaagcg attcttataa gcacaggcga 
31621 agatgtgagc aagattgatt tgaagatcga tggcgagttt actgatgaag caatgacgat 
31681 gattgaaaga tcgcttgaag cacttcctgg ggttcagagt gttgagatca gccatggaag 
31741 tgataagata tctgtgttgt acaaacctga tgtgacgggg ccgaggaact tcattcgtgt 
31801 gatagagtct actgtctttg gtcatagtgg tcacatcaag gcaacagtat tctccgaggg ~ 
31861 aggggtgggc agagagtctc aaaagcaaga agagatcaag cagtactata agtcgtttct 
31921 ctggagtttg gtttttacgg taccagtgtt tttgacagcc atggtcttta tgtatatccc 
31981 tggaattaaa catttgctaa tgtttaaggt catcaatatg ctcaccgttg gagaaatcat 
32041 aaggtgggtt ttggctacac ctgtacagtt tatcatcggc tggagattct atgttggctc 
32101 ttacaaggct ttacgccgag gatcagctaa catggatgtt ctgattgctc tgggaacaaa 
32161 tgcagcttat ttctattcgt tatacacagt gttgagagct gcaacatctc ctgatttcaa 
32221 gggagtagat ttctttgaga ctagtgccat gctcatttcg tttatcatac taggaaagta 
32281 tctggagata atggcaaaag gcaaaacatc tcaagcgatc gcaaagctta tgaacttggc 
32341 accagacact gcgatattgt tgactgtgga cgaggaaggg aatgtgactg gtgaagaaga 
32401 gattgatggc cgattgatac agaagaacga cgtgatcaag atcgttcctg gtgctaaagt 
32461 agcttccgat ggttatgtta tatggggaca aagtcatgtg aatgaaagta tgataactgg 
32521 agaggcaagg ccagtggcaa agagaaaggg tgatactgtt ataggaggca cactgaacga 
32581 gaacggtgtt ctgcatatta aggtgacaag ggttggttca gagagtgctc ttgcacagat 
32641 tgttcgactt gttgaatccg cccagctagc caaagctcca gttcagaagt tggctgatcg 
32701 gatatccaag ttctttgttc ctctggtgag tagtattctt gtcggttgaa attcaagttt 
32761 agtcttttaa aggcacaact acttcttaaa cttacgaatt ataatattgc aggtaatttt 
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32821 cctatcgttc tcaacttggc ttgcctggtt cttagctgga aaactgcatt ggtaccctga 
32881 atcatggata ccttcttcaa tggatagctt tgagctagct cttcagtttg gaatctctgt 
32941 catggtcata gcttgtccat gtgctcttgg gctggctact ccaaccgctg ttatggttgg 
33001 aactggggtt ggtgcatccc aaggtgtgct gataaagggt ggtcaagctc tagaaagagc 
33061 acacaaggtt gagcctcttg cattgtaact tattatatta tatatgattg ttctgtacca 
33121 ttttgatttt ttttcagatg atgaaaccta acagatcgaa tgaatgtgta ggtaaattgc 
33181 attgtatttg acaagacagg aactctcacg atggggaaac ccgttgttgt taaaaccaaa 
33241 ctcctgaaaa acatggtact tcgagaattc tatgaacttg tggctgcaac tgaggtaatc 
33301 tcttgtaact atcaaacaca tatacaaact cgtggcatgc ctcaaaagtt tgatctctac 
33361 gatgtcacta tctttttatc aggtaaacag cgagcatcca ttggcaaagg ccattgttga 
33421 atatgcaaag aaattcagag atgacgaaga gaaccctacg tggcctgaag cccgtgattt 
33481 tgtgtctatc actggaaccg gagtgagagc gactgttaaa ggaagagaga ttatggtggg 
33541 aaacaagagc ctcatgtctg gttacaaagt tactattaca gccgatgctg aggagttgct 
33601 agcagaagct gaagagatgg cccagacagg aattctcgtg tctatagaca atgagttaac 
33~66l tggagttcta gctgtttcgg atcctgtaaa. accgagtgct cgagaagcca tctcaattct 
33721 aaaatccatg aatatcaaaa gcatcatggt aacaggtgac aactggggaa ctgcaaactc 
33781 cattgctaga gaagtcggta tcgactctgt tatcgcagaa gctaaacccg agcagaaagc 
33841 agagaaaatc aaggaactac aggtatgtag tacctccaga gtaaatatct tttaaaatat 
33 901 aagattgaaa atgatagaaa caatgattct gtaggctgcg ggtcatgttg tggcgatggt 

33 961 tggagacgga gtcaatgact cacctgctct cgtggcagcg gatgtaggaa tggccattgg 
34021 tgcaggaacc gacattgcaa tagaagcagc tgatattgtc ctgatgaaaa gcaacttgga 
34081 agatgtgatc acagccattg atctatcgag gaaaacgttc tcaaggatcc gtctcaacta 
34141 cgtatgggct ctcgggtata acctcatggg cataccgatc gctgcggggg tgcttttccc 
34201 gtctactcgg ttcaggttgc ctccgtggat tgcaggtgct gcaatggctg cttcttctgt 
34261 cagtgttgtg tgttgctctc tcttgttgaa gaactacaaa cgtcctaaga ggcttgatag 
34321 cctggcgatc cgtgaggttc aagtggagcg ggtttagaaa accaaactaa accgatccga 
34381 acagattact tggttgtgtg atttgatgaa ctgtatgatt tgttacgtta gcataataag 
34441 aaatttcaca tgattggttc aatggttctc aggatggttc ctgataatat ttacctagca 
34501 caaaagatga tgctttatcg tattcttctt ctagaatact ggaaaagaaa tgtcacaagc 
34561 aaactaagaa tgcatgccgt tcaaatctag aaaaacgctc cacactgaca agggagagta 
34621 accttatatc atttgagaat aaaaaaatac ataaatctga tttgaccaaa aaaaaaaaaa 

34 681 aatacataaa tctgacttgt gggttatgac ggcagatgct gagaaatggt ccgagatttg 
34741 tttctttact cgtgtccttt ggatgcattt tatttgaaga tgatgcttta gttttacgca 
34801 tcaggcgcaa taaagctgaa ataataagca accgcaacgg cccatcatta aaagccactc 
34861 gcattaacag cgcgtgagga cggataggcc agagcagaga ctgcattttt catgcctttt 
34921 agctttctct cttttgtcct cctagcagct gactcttctc tctcatttct cgtgcttctc 
34981 aaattctact tctatggcga cgaaagaccc agaatctgtt tacgatataa gcatcgaggt 
35041 gacaattgct taacttcaat caacactctt taataataga aaaaaatgtt tgtgtgtttt 
35101 tgctaaaagg aatgatcttt ggtttaggtt tttttttagt gtgtaaaaaa gagatccatg 
35161 aatggttagc ttcttcttgt ttttctcaag aaagaaagac tattactttg aacttaagat 
35221 gtattaccct cttttttttt gttgttttaa agtgcaagtc tttatctaaa taatcgctat 
35281 atctcttctt cttgtttttc ttggaacagg atgcaaatgg aaacagctta gaactcagtc 
35341 aatacaaaga caaagttctt ctaattgtca atgtcgcttc caaatggtta agctttcttt 
35401 tacacacaca cacctcttag ctataaactt ccattattgt aattttgatg ctaaacgttt 
35461 gtttctgatg gtctcagtgg gatgacgaac tctaactaca ctgaattgaa tgagctttac 
35521 aacaagtaca aagacaaagg taaaaaactg gtttcaacca aaaatcactc aaagtctctt 
35581 tgtttcttcc cagctgatta gcttttttca aagacttgat gttttgtttc tgctgcaagg 
35641 tctggagatt ctagcatttc cttgtaacca atttggtgaa gaggaacctg gaactactga 
35701 ccaaattaca gagtttgtct gtacccgctt caaatctgag ttccccattt tcaacaaggt 
35761 aaagaacatg aattggttgc aagttgtatt gatgttgaat ctttctccgt tgtgtgtttt 
35821 tagattgaag tgaacggaga gaatgcttcc cctctttata agttcttgaa aaaaggcaaa 
35881 tggggaatct tcggggacga gattcagtgg aactttgcca agtttcttgt tgacaagaat 
35941 ggtcaagctg ttgaacgtta ctacccaact acttctcctc ttacactaga ggtaatgaaa 
36001 ccgtctcttt ccgtctctga agtcctgaftt cctgaacact ttttttactc tctttgtgct 
36061 tatgttatgt cctcttgttt ttttctgcag catgacatca agaagcttct gaacctctca 
36121 tgaatggtgg tgttatccga ttcactctgt tgataactct actttggtcg tgactcgtga 
36181 acttgtcatg taataaagca aaatcacatt tgactttggt tccaatttgt tctgtttacg 
36241 tttcacaaat ccaaaccggt tgcactcagt gactcacggt taatcaagtt tagtaaaaca 
36301 agcattacct catcataata cacaaaaaat gttttagtaa atcaacttta gtaaaataaa 
36361 atacttatta atgtgttggt ttctataagg gaggaagaac tgaataaata ctcacggtta 
36421 agacctctct actgatgaaa accaggtttc ttggtgacga ccacaaagaa agagagagag 
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36481 agctttatgg gatagctaaa gttaaacatc tccttaaata ataacctgaa gaaaggggtc 
36541 tagctagttg ctacattaac aatgttacaa ggaacacaga gatctgtctc aattctccga 
36601 gatttctggt tttaacattc tcagataaaa tctacaacaa caaagaagat atttacagca 
36661 ttcacattct gcacgtatgc tgctgctgct gctgctgctt caatagctaa gaaggtagta 
36721 aaataggaag aaagattaaa tactgtctcg ttctgaatca tccgcgagtc aagtcaatat 
36781 cagtgagtag gtttcttccg ctagtagagt caagtgatct gcctgagact gagactgaac 
36841 cttcccctga accgggccat ccactcattc cctgacctga gaagttctgt ggcgtctcct 
36901 gacccacctg tgcttcttct tcttcttctt cttcttgtgg attagtctcc atttcgctat 
36961 ccttgtgctc ttcctttggt ttattttctt gtttgatagt cttgttactg ttattaactc 
37021 ctccgtatat aaaactacaa gccacaacct tcaaacaaca acaaaacatg ttttttttat 
37081 ttctgttaag agacattaca aacaagaaac caaactcata ataccacatg aaaccaataa 
37141 agaagaggaa gaattgagaa acctgaacga ggctggctgc aataagcatt ccaatcccac 
37201 cgccaataac atgaccttca ggaccagaaa gggagacact caaaccgcct gttcggcttt 
37261 tggatccacc ttcttcattc accaaataag atccccctag acttagtatc tcaaaacgtc 
37321 cctaacaaca acaacacaag gtgattgata taaagaactt cattagtttc ctcttgtttt 
37381 gagtttcttc agttaacaga atgaaacgat ttttgaaaaa cctcatatgt caaagaagac 
37441 tctgttgtag cgggttgacg tagagtgact gaagaaactg ttccggtgcc tgacattata 
37501 caaagagctc gaggtctttg ctgtgagaat gacatgactt ttgaaacgat atcctgcgag 
37561 aaaaataaaa gaaaacagat acagaatctt aatatattaa atccataaat atattattag 
37621 attgaaatga ttttttattt ttttctggag agcttacttc tcctgctcca acactgatga 
37681 catgaggtgc aaaagcaagt ccagctgaag tgttcatcca ctcacctaat gtaatcaaca 
37741 gaggtaaaag attgctctta aattaaaatt tatattgaag actaatcaca tttcaatgac 
37801 ccattactta gctatacaca gattataaca cacatggatc acttaaaatt actttttcca 
37861 cagcttaata cctaacaagc aacgttctgt aggaattaca acatatccaa atcaaaccaa 
37921 actaatttgt ttcagcttaa gatttttttt cactaattac taatctaaac aaaagtagag 
37981 attagaccac acacttggat cattaaatta tcagtaacag cagagctaaa ccaacatagt 
38041 ctgtcttaat ttggccacac aatcatctat ataccttcta agcatcactg gactttcaca 
38101 taaccataca tctaaactca agttcttcac agtgtaacat tgaaaaaaaa cacttgagat 
38161 ctcaccaaga ttagctaaac gttgcttcct tccagtacca ggaggtcgac ctctggctcg 
38221 tttaggagca tttggatcag tcactgcaga agagtcctta gctgctttac tactaccacc 
38281 agaagaacaa ggcacaggag aaagccctaa agagacttgt ccatcaggag gagcgtactt 
38341 cctcggccgt ccacgcttct tcttaaccaa cggctgctcc atctgtggcg gctgcagcgg 
38401 ctgcacagaa gcagaggaag ccatccccat gtgaatgttg tgtccgaaat cagagtgtct 
38461 gtggtcggcc atggagaaat gaggacccgg gttgttagct tgaggatgat gaacgttagg 
38521 gttcggcata ggtctcattc ccggcggcgg cggcgcgtgg agaccactcg cgacctggga 
38581 aggggagaga ttggtgaagg ctcctctttg gaggtagtag tgaggaaatg acattgcttc 
38641 tctcccgtcc atgcatttaa atgccttaaa ttacaagtct tttcaaatgg gttttgagcc 
38701 ttttagatga aacattgaga ctttatgaag aagagagtag gaactaccca gaaatagtaa 
38761 accctaattc cccaaattag taaagagaac aacaatggcg gatttttttt tttttgtgaa 
38821 acctataaag aaggaagcta aaaatcttga gctgttacta gtactgttag tctaggagag 
38881 aaaggaaatc tcaagagaag agaaaagaga gagagatttg atttcgaaat ctagggtttc 
38941 cgagatgata aaaacgcaaa tgtacatcag aggagaccag ggaagatgac agaagagaag 
39001 agaacagaaa agagagaaga aagagagaga ggtttataca gtagtttttg atttttttta 
39061 aataattgtg aagagagaga atgaaatttt aggccttgtc tgaaagttag aagactgaag 
39121 aagaagcgag gaaggagaga gagtgacgga ctgacgacca ctattttttt tcctttatat 
39181 attttaaaat tattttttat tcaaacaaaa ctattaaacg gttttaataa tattaaattg 
39241 acttgtttta agaatagttt tattcaaaag cttttgtact ttccgttttt tatagttgaa 
39301 tttaatgtct ggaataatgc atatacctct cactaaaaat gtaatttagt cagatatata 
39361 tattaatgcc tttcatgtat aaattatagt atgcttttta acttttttta ttctcatgaa 
39421 aatggtatgc tcatctcagg ttaacaatat ttacacaatt aaaataaaag taattggcat 
39481 gaaaaataaa taataattta caaatttgca aaatagaact ttagtggaat tgaggggtag 
3 9541 atctagtccg tttggtgaca agttgacgtg aacaccctta tcaatctgtt ggctgagaaa 
39601 tagaaaccaa gagttgaatt atgatcccat cagtataatc acaaattatc cacataaaag 
39661 ctaaaagaag actcatccac agtcatgact atactctatc aaacataatc ataaactcat 
39721 caaaaaatta gaaactataa aaataaatac caactaacat ataaaattat ctggataagt 
39781 tatatataga atgatataaa acatatgtat cacgatgcag tctaccctat tcggatatat 
39841 aagtaataca gtattatact aattatctgg ttatttttat ggttgttggt tatttaagtt 
39901 agtggtcagt cagattttga atgaatagtg aacttgatac ttacctttat attataaatc 
39961 ataaacttta aactcaaaag ataaacttca aatcttaaat ctaatagata aaccataaac 
40021 caaatgctaa acatcaatca taaacataaa tgataaacct tgaacctaaa agatcaagct 
40081 taaattttgg ataaacccta aattcaaatt aatagagtaa ccattaatcc tacaccttaa 
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40141 acagcaacat gcaaacttta tctttagagt aatataaatt caaaacaata aatcatttta 
40201 tataatacta agttaactat atggtcaaca ttgctttaga cttgttaaaa acatctatat 
40261 tttactaatt agatttttcg gtatcaataa attaagctgt gtataccagt tattcggata 
40321 tcattattat caacaagtat ttacatcata aacatatagc cgaaaattac attataatct 
40381 ggaatcataa aattatttaa gggaatgcta tataatcttt tcatgagata caaaaatgaa 
40441 gtttctattt caatcattaa atatatatat atatatatat atatatatat atatatatat 
40501 gtatctcaag atttttccgg tcatcgtctt caacacttga tttctactag gcatgggtat 
40561 tcggggtcct aatcgggttt cggttttatc cattcagatt tcggtttttc gggtttatca 
40621 aaaccagccc tattcggatt atttgaaagt tcggttcggg accggttcgg gttttatcgg 
40681 gttcgggtcg gggttagtaa atcttcaaag aaccggtata atccaatgta ctttcggaag 
40741 ttcgggtccc aatcgattat tcagtttaaa aatatctgat ttgtacctac tttgtaacta 
40801 aaacataagt aaaatcgatt ttttgggttt caaataccta atttgaacat ttttgtaacc 
40861 aaaaacataa gtataatcga tttaaaaata agaaatgaac atcaaccatg atcatttaaa 
40921 atcaaatgaa aatgaaatat agttattgat aaaaagaaaa gcaaataaat gaaaacataa 
40981 aatgaaaacc gcgttctcat gaaatgtaaa acattgttta atgaaaacaa aatctaaatc 
41041 taaatattta aaaattcaac agccacattt aaceatcaac cttcatgtaa tagaatcatc 
41101 aacattcatg taatagataa gtattttata tgttcagtat atcttaatgt attttgaata 
41161 catattagga attgagatca tgtttgatac aagatatttc tagggttttg aatgtttcgg 
41221 gttctatcgg atatccattt agtttcgggt tcggtacgga taatacccat aacccgaaat 
41281 accatgaaac aagatccatt cggtatttat gtcgggttcg gatcggttcg gattcatttt 
41341 tatcggatcg ggttcggttc gaattttcgg attcggttta tttgcccagc cctaatttct 
41401 accattctaa actccaagaa aatgttctat attgttcttt ttgttcatgt caagagatct 
41461 attaatagct gtgctgactg ctgagtaaat cgtatgtatc attaattgat tggagtttgt 
41521 tctattcatg tctttccggc atgaacaaca aaatattaca caatcttaat taagtaaaat 
41581 ttttagtcga aataattact ttaatgtatt aacaactagt ttttagtggc cagcaagcac 
41641 atctctaata -tattatttca tttcttactc taaaacggtg taaaccaaaa atagaattga 
41701 atttattcca atgtattact ttatttttta ctttaaaaat gaatactctc aaaataattc 
41761 catttttatt taattaataa tttttagtaa taccctcact tataaaaatt atcaattaac 
41821 tccaattatt ttatgtttac aaaaatttca taaaatataa tttatttaac ttaaatattt 
41881 attattaaaa tacaataatc ataaatatat ataagatata tagacattct ggttcaataa 
41941 tgaacattag aatatatttt ccacaaatgg ttaactaata cattttaaaa tgaataacaa 
42001 gttttttatc tttgattttc ctaaatcagg gaaaaaatat gaaatcggtc atttttatcg 
42061 ttcggacaat acttcattga ttttagcggt tccataaatg atttaggaga ttatacttat 
42121 tttatttatt ttatattttt atgtttagtt ttggtatttt tattagttta tacatcttta 
42181 tttaaattat taaataattt cttatgaaaa taatattttt catgttattg tatattttta 
42241 aatattgaag tattaaataa aaataaggaa aatatgaaat aaaaatattt aaaatttaaa 
42301 agactatttt gtaaataaat aaaatcaact ctaaatagaa taatgaataa aattactcta 
42361 tatatagata gagtaacctt tgaaaattga gtgagattta agtatatttt atttaaaaaa 
42421 gatgtttaaa atagaaaatg gaataggatc aaagatgtcc cctgatacat tcgtgtgttt 
42481 ttttcgtcaa acacctagta tgctcttatt gcaagattta aaaacagaaa acaaagcgac 
42541 ttttattttg gttcttgatt aaaaagaggg tgccatgaga gagatgtctt gttgactaaa 
42601 gagaggagag ggaagcggac ctacttgaaa agagccaaga agagagagga tatctctctt 
42661 tctctctcac atgtctgaat cactttgcta tttctttctt tgcttcggac agtgagcctc 
42721 ctttcctaac tttatttgtt ttcacttttt atattcgccc ttttcattta attttttatt 
42781 aaaagtcttt ttttatttct ctcaatattt cttttgaaaa tgattcaaga aaatatttat 
42841 actatatgag ttccacactt ccacctaaat tgaaagcttg tatgatgcat ttattgcatt 
42901 taatatccgg tggctaaaga gttccgacca ataaaatcca atttgtcaac attttgtaat 
42961 ttgtactata tatttatata gtcttttttt ttgttttttt atatagtctt tttagatttt 
43021 gttttgctaa ttttccatcc tctatttata tcctctaatt tctttttttt tccattttag 
43081 aattaatatg cggtccttaa tctttgcact taacttgttt tgtcaacaat ccctacagtc 
43141 agttatggca aaattatggt gtcaaattac acggcttcaa cgcttattta atttggcttt 
43201 ttgacaatga aataaaataa atttgacttt ccattctcac ctttttgcgt aactttcacc 
43261 taaacttagt aacttgtcca ttttataaac taccgatttt gatgaaaaga tcgcataa.at 
43321 atatttgagg ccgaacacta ctgaacttgc caaaaaaatt atatattctc acataaatgg 
43381 ccataaacat acttaaatgt gtccatataa catttttgct agttcaagag agcaaaacta 
43441 tatcagaaag tatttctcag aaaaaaaaat tatagaagac tgtgaatcac aataaatgta 
43501 gatttactaa aagtggaaga agagtgttca aagaggtttt cattttgagg atggtgtctg 
43561 ccaaaggtac aacaaggctt tatttctaag aaattgccgt agttaataat gtgtttcaca 
43621 atacgtatat atacatagag agagagagat tatgtactat tataatagtg ggtctgtgaa 
43681 acatgacgtc attaatagtg tgtctttctt gtaaggaaag tgatctcaac tgttgctttt 
43741 ttattttcct tctgatgtat tttgtgggga taagagagat atgattctct tcacaattct 
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43801 tttgtggtcc ggatatttct ggtccagtag tttcaaattt ctactagctg ctggtagttt 
43861 ggtgttgctc tagttttttt ctgaggtttg aaatgtgaaa caaacaacaa ccaaatcatg 
43921 atacattgaa ctgaagtttg tatatttatt ctttgtgttc actcttcttt ttttgcattt 
43981 tacaaataaa actgttgaaa -cttaaaacta gtattccaac gatctttcaa ttaacggttt 
44041 agtaacaaaa atcttgtcac gacaacaaaa gactttacat gacataaaag tccataaact 
44101 gattattacc caaaaaaaca ggctagcaag atatttgatt tcgaaacatt aaatatgtag 
44161 tatatttggt tcgactggtg cgtacattag atttaatcgt tatattcaag ttaatctggt 
44221 aaattttata gtctaaaagc ttttatgttc tccattccaa tggaaacatt acaaaaaaag 
44281 aattgacttg agaaaaacta tttaaaattg aatgaataaa ttaaattgat ttatatttca 
44341 gtgtttattt tggttctcga gtgaaaaacc agggaaggat atgaacctga agacaccgtt 
44401 tagctaccct tttgatgcgt ttacggcttt tccactgcaa atcaccaact gctgcaccgg 
44461 tcactaccct ccttgcccgt ctatggtcaa gtgacggtgt ctttggctcc attgatctgt 
44521 taacatagat caatcaaaca aatggattct gagaaatggg aagaagcgat caggtaaagt 
44581 atctcattct ctttttttta tacgtttcca tggcaggact taggtttatg ccaataataa 
44641 actccacaga acccacatga aaagtagctt tgatggtgac ttaaactacc gaatgaagca 
44701 aatgtataga ggctacaaga gagtacgcac gaataaaagt ttgtgccaat gtttttggat 
44761 ggtaaagaaa caaacctgga gggatcgttt tttggagagg attctgccgg aggacagcaa 
44821 gcgaagcaga cataagcctc aggccgccag tgaagtttaa cacaataggt gtgataccat 
44881 tcgccacatt gactacaaga aaccatgaat cttgagttgt atggcttcag gcaaacgcag 
44941 tgcaacatac ttcgagctcg taatgcctgc atcaaatcaa accataataa tacaaacagt 
45001 tttagataaa cagtaaaggt tgtgttgtgc ttgttaacgc tcatatcttg ctaacctgaa 
45061 gttcttcctc tgcaatgacg ggcaaatttt caccctcaga gataagctcg aatacatctt 
45121 ccaaggcaag agcacccgag tctgtcacca cctggagaca aaaccaataa ctctcgttaa 
45181 gcaattccaa agtggtagca agcaatataa tgagaaaccg tgggtgtggg ttggtcatgt 
45241 acttttctag ctcgttttgc ccactcttgt ccagtgtctt ttagctccac aagtctcttg 
45301 cctaagtgat actcttcagg taatattttc agtgtttgtc cctgtcaaga agcacataac 
45361 ccaaactcat caaccagatc attgtggtgc tctttattga aataaggtat ctctcactaa 
45421 gcgcaggaag ttgatattat tatcagaaca taccgcttct aagtgctgct gaattttctg 
45481 taacgagggc tttcctagat cttctggtat ctgcaatttt aagacgagct taaacgatca 
45541 gaaaggggtt tgtaaccaca attacatctt gaacacaagt atcaaaacca attaattaaa 
45601 attcaatacc ctattttgta cctcagggga aggtgaacta gagcctaaca agatggtctt 
45661 ataacaccat cctagtttta aacttgccgt ttgaagtttg ggaagttctt ccaaattaag 
45721 accaagggac agaccagatg ttctggcgca gtccactctg gttatcaagt ccataatctt 
45781 cggaagaatc gtgctatgct tgccatcatc tgttttcgca gtgtccaaca aatcttcagt 
45841 ttcttgcaag agcgggtgat tatcacactg ccacctctcg caactaagca gtaatgtttc 
45901 aagaatcctt ggctcttcaa gt tgaacacc aagcgattta gactgagcaa ccaggtcctg 
45961 cgaaaatcaa aacattgaac actctataat cacatgttac agaaaacaga aattccacag 
46021 gaggcagaca aacctttaac acaggaagtt caagcagaga acacggagat gaggcagtag 
46081 aggaaacggc agataaaaag ggctcagaat tctgaagcca agtttcagcc aacgaaatgg 
46141 tgttctctat gcctttcaga gagggtaaaa cggcaccaat gttgacagac atcctacgaa 
46201 catcatcttt ttgatgaaca aaagacttga gaaatactaa gcatgggctg ggttaagaaa 
46261 acatactgta cgagttcttt aaggtcggac atctgagttt cactggcaag aatactgctt 
46321 gctctttcct cccaaaacct cgctgtagac aaagttccgg cgatctcaac aaacagcttc 
46381 tcctcttcga tttgtagtct ggaaatagaa agtcttggca tgaaattata atggagtagg 
46441 atgcatatta catattctaa aatcaaatat tagtggacga cgtacacaac agcttccgag 
46501 agcagctgct caatgaaatc cagagacgat cttgcagcat aaacctacat* cgatatgaca 
46561 agctgcatca aagatatgaa ccacagaaag agcactcgga acatgaaacc taatgctatt 
46621 tcattatata aaacaaattt ccatgcattt tacattcaac acaaaatgaa agaatgcgaa 
46681 ctatcacaaa tagtatttta aatatcttga taattgctgg agaccaataa gaggcaaatt 
46741 caaaaataat tagagaaaac atttaccgtt tgtgcttttt cacgacaaga tgccttcttc 
46801 aattcaacct caacaagagg gagcccttca actatttcag atactgtggt tagtgttaag 



.till taaaaagagt gtcaaggatt agaagtaaga ccttgaatgc ctaaggatgc tccatcctgg 
46981 agaagggaac tcaggtcact gattagcttt cgttggtcct tgccttcacg aacatcagcc 
47041 attgtatcat caaagcgtga aatccaagac aaagcatcaa catggtactg cctcaggata 
47101 ttcagctcac gaatgttaat gctacaaagg ccatcgaatt catgaaccaa atcttctaca 
J 7 " 1 ttctgtaaac aaaagaaacg agtgacttca gaatgattcg acaaaagtgc atgttgttat 
:Z*Z} tcacacttaa agatacagaa caaaagtaca gccaaccttg agagatataa aaccactcaa 
Zll.l aacttt 9 cta catttatcac gggctagttc tgatttcttt agcaaatcca ggatcccttc 
:ZAl t 9 tttct 9ta agcgtcacct gaagctccaa tatctattaa gggaaaagaa cgattagaac 
47401 acatgaacaa gtgacatagt gatcattttt agaataatga tgataatagt ctacctctga 
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47461 ctttagcttg aaaagttcat ccaactcaat tccaggaggt tttgaagcgg caagatagca 
47521 tctcgccctt ttagctaaca tctgccgacc catgaagaaa ataatttatc aaattcaaga 
47581 aagggcaaga aatgaaagcg aacctggaaa atcgaaacca ctagaatagg agtatgctcg 
47641 taaactttga aaggagaaaa gaggcaaata catctagatt caatatggcg tcagtctctc 
47701 aaagaccttc gtggatatgt aggtataaga aaaatacgtc ctagataccc ttcaaaatta 
47761 gcaaggattg aagaaaatat atctcatatt taatccaata tgcaatattt atatactcct 
47821 tttgactcaa acaccccata atagaaactg cgagctagat gaaaaaatga ctataaatat 
47881 gtctgtccga accatggttt accttcgcgg aagatatttt ctttgacaag atcttgtgtt 
47941 tacttaggga gactggtgaa ctggaaactt cggaatgcaa tagctccatc tgggcgatct 
48001 acaaggaaaa acgggtaaca ctaatcagaa agcataactt gaatgatttt actggatttg 
48061 gtcagagtga actatcgtgg ggaagttttg tagtggcata ccgttgggct acttgacaga 
48121 gaacaatcga ttttctcaga caacttccta gcctcttcag caaagtccta caaagaaatg 
48181 gaaatgctca attgataaca taatgactta caacgctacc aatccaaaag gaaaactgga 
48241 acctttcagg tacatagact aacctttaat ttaagataac cagactgaaa gcaaggaacg 
48301 gggtcaactt tcagcaactc atcgatgaac- tccaagtgaa ctttctctgt _atcattgt tg" 
"48361 cctttgcctt cgacttttga aagacagtcg ccaacagctt cagcccatat" ctttgct ttg 
48421 atcaaacttt ttgcaacatc ccgtacctgt gtgacaaata gaaaaaaaaa gtttttttcc 
48481 ataatccacc caatgagata tatacaaaat ctttccaaca . gatact caaa actcacgcgg- 
48541 ttcatttctg atccacccca aagaaactgt tctgcttcct ttaaaagagt ggcatataca- 
48601 tcactggaaa agacatcctg gagaaccttt gatgctcgac gaagccaatc ttcagctggt 
48661 cgtgcctggc aaacttgcat cccctcctgc tagtaaaaga aaagacagca atattaataa 
48721 actgtaataa gattcaaggc caagcaacca aatatgatat atacaaaaat ctaggttgag 
48781 acatgtttgg acattaatat ataaagggaa ggctcctagt tttttttttt tttttttgga 
48841 actgctccta gttatttaat actagacttt atcaaatcct taaatacgga tgaagaatgg 
48901 cgatgctgcc gctcatataa agtccaatat ggaaaacaaa aaaaaatccg tattgttatt 
48961 tgcaagtaag tataaaacta cctttttgtt agaagcacct gaccgtttgg tgcctgaact 
49021 cggccgttgt gaatttttgg tttcttgagt tttgcagtca ccaaactttt caacttcttg 
49081 taccattcca tgcaactcgg caagggtata ccgatatacc aggcgcagtt ttgtaggttc 
49141 acattcacaa aggtgcttcc agtgctgaat tcaaaattat aaaaacttac agaaaatcga 
49201 gacaagaaaa tcttaagaaa aatctgccta cagtaacgca cctttagaac cctcaacagt 
49261 tctatctaca aaaaaggcta gccaagttga tccccctcta agaagttaga tatagatatt 
49321 tacctccaag catgcaaaaa cagatggcct gcagctgcag acgatagcag aaagatggag 
49381 aaactgctgg cagatgatgc atgttgtatc ctacaaatac cattataaat ggtccacacg 
49441 tcggagtatt gtcttatgaa agctaacatg aaagagtgga caagcttaac atcaaaacca 
49501 acaatatctg attgcagaaa gaaaaaacct cttcgatgcc cacagaatca gggcattcag 
49561 gtagaaacat aggagaggat ctcaaaatac cacttttcca aagctgctct ctcaaagttt 
49621 tttccttgct atatattctg agcagttctt tcttcaaata tattgatcct ttgccatcgc 
49681 agctgtttcc ctataagtaa aagacaatta aaacaatcat ctacatttgt tgcttgaagc 
49741 tagcatacac aattccacta gtaaccaatg tcaaattact agtagatact cccgtgatgc 
49801 ctcttttaat gtagctatct tcagtacccc actatcccat ttgatcaagg gtaccaatat 
49861 taagtaattc tagaacaaaa ttttctttcg taaatgcaaa agatatatag accatttaat 
49921 caaccaaaac caacaaacac atactatctc ttcaccttag ctaccacaca gagaagctct 
49981 tcatgagata tgactgcagg tttacggtac agcctataca gctccgcacc ataaccacca 
50041 taaggtagcc aatcggcagt ggcaaaattg acggcctctg cacaattcaa acctgatagg 
50101 acagcacatg aaagtttatc ttaaactgca taatgacaat ttgatgggaa cactttaact 
50161 tagattagag ttggcgtacc gaaattgaat ccagcatgaa aggattttgg aaacgtgatc 
50221 acaaagtttc caggctccta gcagtgagaa caagaacata aattatgttg gaatactttt 
50281 aggaataaag ttattattaa tcataaactt gtaaatgcac tagaaatgct aacaaccaaa 
50341 tgaactatat acttgtatac caaacctgta atactgtgta cacagggact ttattttctt 
'50401 gcaaaacagc cggactcaac atagtaacca gttgaaagag caaatctggc tgagcatcaa 
50461 agagatcagg tagggctttt cgcatgacct acagaaaaat cacgcagcac aagcacctta 
50521 acttttcttt ctgaaagtga gcaaatattg tagtatgcca gaagaaatat attccaaaga 
50581 aggaaaagga actgatactg ggccgtgaat aactagccaa agattagtta caatctggag 
50641 cacacaaata aaacaaacct tttcaaaagc actagcagca ctgcctggaa caccatacca 
50701 acattttgct tctcccctga aatgtagaat caaaaagaaa acaattaaga gacttattat 
50761 gagtgaaatg tttctttatg acaaaataag gtattaagca aggtgtcttt gcgacttcca 
50821 agaactcaaa aataacaaaa gaagaagatg acgattacag tacaggcaca taccagtggt 
50881 gataattcac agagtaaaaa caatggtcct caaaatgcca acaaaaagca gagaagagca 
50941 ttccaagata tagccaaggc actgtaacac cattgatgtt atgccgaatg gcctgaagca 
51001 tagatccttt caacttaggc atgttattga gattccaagg gctatggcaa tat'tcatccc 
51061 aaacatttgc ctcaactgat tctggtcttc tatcacctat tctaggaaaa ccactcccgt 
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51121 aaacagaagt atccaagtca ttaccataca taacttcaac gtcaccacct gacccttcta 
51181 ctatttccca aaacttcttc tcaatctgcg tcctagacac cggccctgaa ccaaaccact 
51241 tccttttggc gcgatcagca atccgcttga aatcttcaag tagcaaacat ttacccggca 
513 01 caaagccaaa agtgtcttca tcagtgttta agcaatcaag gcaataccag ttcccaggcg 
51361 ggatatgctt caacggcggc gagagacagt atatatgcca acctttatta caactatcac 
51421 acaagagcat cacttcacca tgtttgtcac ttttgcactg ctcacatgcc tgatcaacct 
51481 cttcttcttc actttccacc ttatgatttt tctcgccatt atttcttttc cttctcttcg 
51541 agctagagaa ctcagaacac cgtcgattcc ttttgtgacc ctttgcagaa ggatccgcac 
51601 tcatcatccc gtgatacttc tcaaactcat. gcaaatgctc tttatacaac tgacacaaaa 
51661 catgcttagc acacttggag atcttctcac cagagctcat aaactgataa acctcacccc 
51721 atttcttccc cttgacaacc ttatcataac ccccaaacct cttcaccgcg ttaaacacct 
51781 tacacagatc caaatcaccc ccttcaaaaa ccaccctctt cttcaccttc ttccccaaac 
51841 gctcctccac aaacctcccg tactccagct gaaacgtctt ggagttgcaa gaagcgggac 
51901 ggaactgcaa ^ccggtgaatc tcctgcgtct ttgtcggaaa cctgacagac tccaagtcca 
51961 jiaccgaaagg cggtttccag cttttcggtg gaacaatctt gcagatcccg taagcctctg 
52021" cctcgggctt gatcttgtgt atgtaatcta gagggctttt gaaCtTcgtct tcggtcgggt 
52081 aataaaccgg tccagacggg acattgaccg atcctctcag cttctgatct aaaacccttt 
52141 tctctactgc cttagctcta ccttttccca ttaccaccaa acacaatcca ctatcataac 
52201 ctatggaaca attcaatttt gatcacaaac cctagagaga ttaatccaag aagagtgagc 
52261 gtgaatccaa ttctagggtt ttagcgattt accttgtgag agagagggat agagagagac 
52321 aagtagtgca aggaatcttt ggtttgtgaa gaaagatttt gagaattttt aataaaatat 
52381 atatcgtttc tttataaagg aagattcact taattcaatt tccgaaattg ggttttctta 
52441 gtaacggata ttacggaacg ggggaggaga agggaagcca aacgcttctc agttcgtgtt 
52501 tatggaaaga aaaactttgt gccggttttg ttatcaagta ctgcatgccg tttatattga 
52561 aacaagctgt acaccgtgta gtttcgtttt actcgctata gtgggttttc tttcttggcc 
52621 ctgaatgttg aaggcggctt tgttaggccc agaagatgat taag.ctaatt agctaaacta 
52681 gttttttttc gacaataagc ttttttattt atcgtgacaa aaaaaaagct tttttattta 
52741 tttattatta ttatcatcat cttttccgac aataagctaa agggtacaaa ttcagagttc 
52801 ttcacatacg aaatcttgtt gaactatata agtgatctta aactacaaaa cttttttttg 
52861 aaaagtgact ttctaaaact atctgaattt cttaaatgaa cctttacatt ttatttacat 
52921 gtaagatttg atagaaaaca aaagtttcaa gaaaataagg aaaagttgta tgttttcaag 
52981 gtaagaagaa acaaataaaa cagtaaaaat attattgata ttgctcgcag cctcgcaggt 
53041 aaatgatgtg atggtcggtt aaaaggcgtt catactgttc aaaaaattat tggagcgtta 
53101 tgtcggtggc aattagaaac cttaaattta ccatactttt tgtatcaata aatacctttt 
53161 taattttatt aattaacacg ataattaata tcggttacga atttggttcg aacaaaatgg 
53221 tctaattaaa aaaaactaac aaaattcgct catccaccat actaccatat tccgctaaag 
53281 aaaatgatag actatcgatc ttcgttttct actattgtag attaccacga gaagatgaag 
53341 tttacctcgt ttcttcggtt tttggaacat catgtggcat catgtgtcat caccggagta 
53401 tctggtttat gatgatctcc caagcttatc gtctctactg attatgaagt caaatgtgtg 
53461 ctggtggttc taattaacac agtcaagtgt gtgttgtggt tctcattaac acagtggcaa 
53521 gtcttgagac cttgtatgga ggtgggatga tacgggtttg tgtaggtttc atgagtagat 
53581 ttgacagccg gtcgtcaatg gtcttttagt ggcttctcga tgtgattcct agcttcgtat 
53641 ctcctctatg ctgcttcctt ccccattaca tccgatagaa cgaaggtgac ggctttcttg 
53701 atcagagtgt cacttctagt ttttcatcgt tgatatgtct ctatcgacta cggctcaaat 
53761 tattgttctt cgattctcga aggcttcaag tgttgtagcg attcttgggt gcagccgtag 
53821 atcgtcatgg tgtcgctctt ggtctatgct cacatcccat cgcctcacga gttccctatt 
53881 atgtggtggt gtagcttaac ttttatatgc aggtttagag gcttaatatt aggaagtggc 
53941 gatgtttcaa tatgtttctt gcattgttca atgtggctat ggtgagtgta tagtttggac 
54001 cgatgcttct cttataataa ttgatctttg aatttcttct ttattgtcgt gtgcacttaa 
54061 aaagtggttt agattacaag attggataag atctttggga tccatctatc ccaaaaccat 
54121 catgagaagt ctggacatct gaagtaacaa tgtgaatttc atatttttgt cgttgaaggt 
54181 tgagaacagt gacggattct cgaagattat cagtccaatt taataaaaca cgtcgaataa 
54241 taagtcgaac aaagctaaag ctttcagggg atgagagtcc tttcctcgaa ttcagaagtg 
543 01 aagggcttcc ccgcggttat gagaatttct ggaccgcttt cactttctgt tcagcattag 
54361 taaaagttta atttgagaat ttttgttcgg ttcaacgctt gtaatcaaaa actttatttc 
54421 tttttgagtc ttatttataa aaatctacta tattaattta gagtgctaaa atttatctac 
54481 catttagaag tagtcatttt aatttggaca ttctctagaa tagtaggtgt ttggctactt 
54541 aaatctgttt ggttccttga atatatcaac atgtctaaca acttaattta aaccaacata 
54601 aatacacatt ataaattgtt taactaatgg gctttggaaa attattaaaa ttgtgttgct 
54661 ttacaaatta aagaccatac gattatgtaa accaatttac cttaggcctt ataaatatta 
54721 ggttttaacc aactaacatt ttatttgaaa cttaaaaata aaagactata taatatatcc 
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54781 acaaacaata ttataataaa aacacattat 
54841 tcgttttctc agtaacaatt aaaagcattg 
54901 tcacacatat taacgtttat atacagtata 
54961 tatacactaa ttttttttaa taaatattat 
55021 ttcaattatt atatcgataa attaataacc 
55081 tgtagtccca aacattatta atttatagag 
55141 tgtgtatatt ttataaccaa aaacaatatt 
55201 ttttttataa tttgtatatt ctaatctcgg 
55261 tccgcccttg aaaagggcgg gtcagtatct 
553 21 gattaatggt tggtagtagt gtgtgatagc 
553 81 cccataagtg agttgacttg gtgaggagag 
55441 ctctcttaca tatacactaa cccccaaaac 
55501 actttctggc aaagttgtct gcttttgtgt 
55561 cttgagcttt ctctctcgag ctaaacaccc 
55621 ttttttcaca tttattctct cttctcttct 
55681 gatccgtttt ctcgattctg accccctttt 
55741 tacaagaact gtgttttgtc tgatctgtta 
55801 tcaacgcggt tttgatctgg tcaaggtctc 
55861 gaggggttta tggtttgact gaaacacttg 
55921 tgtacattct gttgcctttc agagcctcaa 
55981 aataaaaaaa cttatagctc tgatcattta 
56041. gatttgtttt ttttctttct ctctcaggtg 
56101 atgtgcctca gtttcgtgaa ttctctatag 
56161 ctacagagaa tatagtatca gagcatggtg 
56221 agttggataa tcagagacgt atcgctgtca 
56281 ctcgtcagtt cctggtaact aactatcttt 
56341 tatgcaatct tggttcctga atctctttgg 
56401 tgttggtcag ttaaggaact ataggatggc 
56461 agagagactt cttgttgctg agtttatgcc 
56521 ctgtgagaat catctctctc ttctctcttt 
56581 aagacagata cttttgatat gtttcagggg 
56641 taagagtagc tttacatatt gctcaagctt 
56701 tctaccatga ccttaatgct tatagagttc 
56761 tttgtttgtt tttgcttctt tccttgttga 
56821 tctctttgtt taggactcga atccaaggct 
56881 ggatggtaag agttatagta ccaacctggc 
56941 tatatatact taaccacttg catatcagtt 
57001 tctgacataa tagtttttgt gtgtgttgtg 
57061 atgtacagtt atggaactct gttgcttgat 
57121 catgtaagcc ttatggttta agatactaaa 
57181 ttctgagaag agtggttctt atctaatttg 
57241 ctggacctca taagggacag gaacattcaa 
57301 ttttcaagtg atgacgggac tgaactggta 
57361 cctcgagaac ggcctaaccc aaaatctcta 
57421.._cttgaggtat tttcccacaa ccacttttat 
57481 gctgatgtgt tttgtgcatt ttaaaaaatc 
57541 gcataccaag cagtgcctca acaacgcctc 
57601 ctgacctaac tgccatacat gagattgttg 
57661 cagccacaga ggttcacttc cttaacacca 
57721 tgtctcattg caagtccttg gcaagactgt 
57781 ttattttaat ctgatcacca tttttggctt 
57841 cgttccagat gtggaccaac cagatgcagg 
57901 ttgctttcag gcataaagac tttgcaaatg 
57961 ttacttcttg attgtttgag gttttcaggc 
58021 tgttgttatt gacagtttat agagggtggg 
58081 agaagtctgt gtcacctgat gaatgagatg 
58141 gcccaagtga tatctcccgc ttggcatatc 
582 01 gctctaggac aagagaacga agcacacgct 
58261 aaaagaaacg ctctatgatt atgataacaa 
58321 gctcgtaggc tttgcagaat atgatcaaaa 
58381 gagccactaa acggctctca tattgtgtgc 



ttttatggga ttaaagttta tttgattttt 
tctaaatatt ttacctataa attttgaatc 
aactctttga attaatactc tataaattaa 
aactttatag tctcaacatt gaatttttgg 
tctataaatt aatttttttt atagttttgg 
gtttcactgt acaagtttta atttaaaaaa 
attcaaaaac tatatatata aaaactattg 
taacaattag aaaagtttgt ttaaataatt 
agttttaatt taaaagaaaa gaaacagaaa 
aaaaatgagt tgttcttagt tttcttagga 
gagaggcctg ctgcctctgc tttgctgttc 
acacgaaaat ataaagtttt ttaattttta 
tcccacaaaa aggatagaaa gttttaggtt 
acacgagcgc ctcgattcct catttccaac 
ttcctctctg ctttactttt aatttccacc 
ttcgcatcgt tttgttaagt tgaagaacat 
atttcttttt cctaagctct gaaaagaaat 
gattgagata aaaatcagca gttttttagg 
attaaaatgg gttgtgaagt ttccaagcta 
gaatccaatc gtggactcac cagctctggt 
gtttttaccc tttttgttgt tttaatcgtg 
gtgatgatag gataggtggt gaagggaacg 
agacgctaag gaacgctacg tcagggtttt 
agaaagctcc caatgttgtc tacaaaggga 
agaggtttaa caggaaagct tggcctgatt 
gtttgcttaa. taataaagct cttaccttta 
tttcttttga caataggagg aagctaaagc 
taatctgctt ggatgttgtt atgaaggtga 
taatgaaact ttggctaagc atcttttcca 
acatgaagct tgtcttgtgt ctcagtatgt 
agtcacaacc gatgaagtgg gcaatgagac 
tggagtactg tacaggcaaa gggcgtgcac 
tctttgatga tgtaagttaa aggttatatg 
tgtgtgcatt ataaaattca taatatggtt 
ttcttgcttt ggtctgatga aaaatagtag 
tttcactcct cctgagtatc tcagaacagg 
tgctttattt gtactgtggc taagtttttt 
tctgaaggtc gcgtgacacc agaaagtgtg 
cttctcagtg gaaaacacat tcctccaagc 
tcatccactc tgtaccaagt accatatcat 
aactttactt gtactgtgta ttcttaggcg 
atgttgatag actcatgttt ggagggtcaa 
cggttagctt ctagatgctt acagtatgag 
gtcactgcaa tgatccctct tcagaaggat 
tttatctttc acaagttctt atatctattg 
atgtaaagac tccttcacat caactaatgg 
tttcaccact cggagaagca tgcctaagaa 
aaaaacttgg ttataaagat gatgagggag 
accttttttt ttcttaatca ttctcgtaat 
aatccacagt gcattgagct acctagattc 
tgtttgttat ctgcaaaaat gttcagcttt 
actcgctgaa cttcaagaaa aagggtgatg 
ctgctgaatg ttattctcag gttagaacct 
aaaccatata cagtttctga tccaaaaaag 
acaatggttt caccaactgt ttatgcaagg 
cctcaagagg cgttgaatga tgcaatgcaa 
gcatcttatc ttcaagctgt agctctcaca 
gctcttaaag acggatcaat gctcgaaagc 
aaaaaaaaga ggaaaacaag agaaaaaaca 
cccaccacca tcttcctcat aagtagatag 
atcctcatca ctggtcagtt tcgtagttcc 
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58441 ctcttttttt tttcaaatac tggatgatgc attcatttgc aaattggact ctcttttttt 
58501 ttttgtgtgt tttctgttac tggcatgaac atttccctat tatacagaag aaaaagttgc 
58561 ctactttgat tatggactgg gaatggtgtg attgtggatt ttctttggtg ggttccgtgc 
58621 tttgtaatca atgttttgta ataacaaata acatgtatgt aaactttgta ttttggcttt 
58681 ttgatgatct tttgaatttt gcatatcaga gatgttcttg tttatgtgct aaacttttct 
58741 gtcttgagat ctaaatatct catacataaa aaagggccca atagggttca aattcagtac 
58801 gaatgctcta ccactgagct atggacacgt ggatgcatag tttgcatttt ttttttgtta 
58861 atgataaata cattaatctt actcgagtag catgtagaat agctgtgtgc aggagtccat 
58921 gctcttgctt acaagaaagg atactaggga aagtaaaaaa ctaagtggtc ttctgtagct 
58981 taaatctata aaaacacatc acacatgttc ttttgttatt ttatctctaa ttgaaaatat 
59041 atttggaaga atgtacagaa aggttatttt tttatttaat ttactggagt ttcagaggct 
59101 cctctggtca ggatattgtt tctctccttt tgcttttagt tgggggtact gattgcatct 
59161 gctttttttg gtactactct tcaggatcac tactctgaaa ccatcaattg gtatttgatt 
59221 gtgtctttta gcacaccctt tctatgattg gcttagaatg caagtgaaac caaagctttg 
59281 tctttttaaa aatatttttt tgagacaact tttgtccgtt gctttcatta ccttcacaac 
59341 tacttgattc ctatgtaact tggaccttta gatctgcaag gagcgtagct gaatcttgac 
59401 ttctctatca agatttgcga ttttaacgtc ttctctctgt aaaagtttaa tgaacagtga 
59461 tagctcttcc atcatttttg aaggagtgtc tggtaagatc aactcacttt tcttt ttatc 
59521 tgtttgccac attctcttct cttctcttat gtacaccatt atgatcaaag gctcctagtt 
59581 ttagtcatgg aactggcatt acaacaatac taatgccttt agaatcaaat tctctgcaga 
59641 agttttggcc ttgtctgatg gttatataga tttgttgatt cctacaaaag cagcaactat 
59701 ggcatcgcag tgttctgaaa acgttattcc tactctacgt gatttcttca actcccaaaa 
59761 gccaggggaa gaggatgagt ttatggttca tcaggtcctc aactcctctt ccaagaacat 
59821 tccaccgaga cctacaaaac taggaaacaa ggcaaaagag agacacggtc tcttcaatca 
59881 gggaagaata agaagcagaa gaaacatatc agatgctgaa aagttctctg tggagcaata 
59941 ctcttctagt ggtttctttg gggtacggtt caacacaaac ggaagacaac aacagcaaca 
60001 acgatcagct aagcctttag ggtcagatag aaacatggaa ccgagattgc agaagtcatt 
60061 ctccgctaga atgcaactcc cctttatgct atcttcaaag ccaagcaacc agtctacgaa 
60121 taattcaagc tggtttagcc gtatcaagaa aatgtctaat ccattttcaa atcgaaactc 
60181 tctgatacca aagtcaggag aaatcaaggt cagtggagta ggagaaacac tctcaagaaa 
60241 caagtcttct tcacctgttc atctacatgc ccatctcagt atccaacatg aacttgggat 
60301 gcctgttttc accttctctc tagaccaccc ggatgatgtg tatacggcca ggacgtggat 
60361 ggatgttaac gactctcggt tcgtctattc gtttcgttac attggtggta gaagcaacaa 
60421 gaacctcgga gaacagaggt cgaacgtttc aggtatagac tcttcactta taggacagat 
60481 gcaagtttca actcaagtct ccttagaggt agaagaagaa ccatacgaag atcctgttga 
60541 atcagccgtg tcagagttcg ttctttttga catcgcacga gcacggagaa gtggactcaa 
60601 gactgaacaa ctgtcaagac agaacagtgt atctgatggc ttaaagcatc tgcaaaggca 
60661 gaacagtttc agcagagggt tgactcgtag tttctcaaaa cattcagaga acagtgcatc 
60721 atcatcatct gatccttggc cagccacgga tttacatcca ggtctagaga ttgcagcagt 
60781 cgttattaaa gactcttatt cttctagcaa taatgagagt tttgagtaca tgaagaacag 
60841 taaactctct agacgagaga tgaaggttat agttccatca ggaaaccacg gtttgcctga 
60901 tgctgaaaac tcatgtccta caccgatact gcagagatgg agatcaggtg gtggatgtga 
60961 ttgtagcgga tgggacatgg gttgtcacct ttttgtctta gaagaacaac aactcatcaa 
61021 caatcaccac tgtctagaac ttttcattga ggtatgattt attttcttgt agtagccaag 
61081 ctaaatgaat atctctgtgg acagttccag gattagtcgg ctgggtctcg gcccagcgga 
61141 caccatgggt tatccaaaaa aatattttat tatggattat ccaaaaaaaa aaaaaaaaaa 
61201 tcttgtggtg gtgattcaac ttgtctttct tgatcttgag atatacaaaa aatgtggttc 
61261 agggagagaa ggagaccaga ccagcaatgg caatggcatt cataagagaa ggtcattacg 
61321 aggtaaactt ccatgcaagg ctctcagctt tacaagcatt ctcagtatgt gtggctgagc. 
61381 tgcacagaac agaagtatca agaggagaaa gaagcgactc tttgtctagg tgcagttcgc 
61441 ttagagagct tattgacatg gcaactccgg taaacagaga tatcaacgaa gaagtcattt 
61501 cgtctttcat gcctaatgtc actttctctc caatttcaag ggtctaaaaa aaaacatatc 
61561 agaacgtaat atctccgtaa tatctccgta gtccttcctt tatgtgcata tatagctaca 
61621 ctcgacatat aataaagtta taaagccaga gtctgtgata caagtgcaga tgaaattagt 
61681 aaaatttaga actttagcag cacaaatgtt tgtacttact ttgtcctcta tgactcgaag 
61741 aagtcggttt gtttcaaatt ttatactctc acttgccacc cgtttcataa tatgtatcgt 
61801 tttcgaatct gttgtttttc tatcctctat atcataaaaa aaaagtagtc gtttacctaa 
61861 atttaaaagg aaattagtcc atgtcagttc agcttataga gtaattgttg tacatacttg 
61921 ctttggaaac aaatctgtaa taaaaattta gtacttcttt atatgtgtga aaactcttct 
61981 aaaatgatca attctaattt gaaatagagg aattgtgaat ctcactttac tccagtgtct 
62041 ttatcttatg ctgctgagtt ttaattgact cggtctagtt ttctttttat ttttcgtttt 



106 



WO 03/006622 



PCT/US02/22217 



62101 aacatagttt tattaaattt taatattata aagagtttac aagacacata accaattttt 
62161 aatttactta ttatgtagca attatgtgac ttgtgaatat aattggaaaa aacagtggtt 
62221 gaccatggtg tgaaaacatt gtcggcaaac aaaaagacag attttaatta tttttgtgta 
62281 taaaattaca atgcaaaagg taatcagatg caaacgaatt taaaaaccaa cgtttaaaga 
62341 tttgattatg tgtggaggaa gtttctagca acaattatga tttccgggtg cataattatg 
62401 ataattgcac acttgtgcat gggggtggga ccttggcaaa acttagcaca attatgataa 
62461 ttgcacttgt caatacatag gaatctatct tattaaaaca gaaacattct attggaccta 
62521 acatttattc tgtaagtttt taaattaaat acacttttat actttatagt taaacttaca 
62581 ttaaatcatt aatgtttctt tctttatact gctatctatg tttccaaaca atatatttat 
62641 ttctttatat tactatcaat gtttccaaac aatatatttt tatactacta tcaatgtttc 
62701 caaacaatac aataattaat cttagttatg ttatatctat cattttcttt ttaaaattgt 
62761 gtagaaacgt cataatttca taaattgtaa aataatgaac tttaaaattt ggagtataag 
62821 attacaaatt atgaaattat tacaatttaa atcaaattaa attacatatc ggtcatccaa 
62881 caattcaatc ggttaatctc gggttttagt aattttttta atatgaatat tttaaaaacc 
62941 taaattgaat tgtcagatct taaccggtat aatcacaatc gggttgaatt taaaaacagt 
63001 gatttaaatg taaaaatatt ttaaatacac acttttaaaa attaccaaaa tatttgttcta 
63061 gttattagtg aaatttttca tcgtaaaata ttccgcgctt ctaaagcgcg ggtcaagatc 
63121 tagtttaaat attattagta atgagagttc tattaagaaa aaaaaagtaa tcaaacctat 
63181 accaaaaaga gaataaatca aaagagaggc tgtccacgtt gacaaaaaaa tcggtcaata 
63241 gaaatagaga ttcacgccac gtcagatgtg tcacttaagt cgttgggctt cgaattccac 
63301 aggcccggca atatgttcat gcatctgtaa agtattggtg ctattttgtt ctcagctatt 
63361 ataaattaaa aaaatatatt aaatccctcc agtagatata gactcagagt agtaagccat 
63421 cgacatcttg ttgaaacaaa tgatctcaat ttggttacca aagttcaaac tctgccatag 
63481 gttttaaacc ttaggcagcc aattccttag ttatttactg ggtttcattt gggtcgagct 
63541 gactagtata ttttggtgct tacctttaaa cttttagtac gctatacata agtttcatat 
63601 atataaattt tcatttatgt gcaggtccat ttattaggtt gaatagttct tttgtaaggt 
63661 tttgtttaca aattcaccat gaaatttaca agtgttcaaa cacatatgga atcttgagga 
63721 ggcaactggt ctattgggtg ctgttattag actgccagca tggccatcct ttaaatggtt 
63781 atcatctaaa atcttaactt ctcttaactc cttatcacat tggaaacttg aaaagcaaga 
63841 gagtctatca acaagccggc cttccgtatt gataggagtg tgatctcctt ggatctttct 
63901 caatcatatg tggctatagg ggctttctta attggatgtc ctcctttttt tttttttttt 
63961 tttttttttt ttgatgtcct ccttttctgt ttaagaaagt cggtttgagt gtctttcttt 
64021 ttttgtttat gattgtgttt gatccaattt ggtggttccc tatccttctt gttgtattgg 
64081 tttttatggt ttgaagtaat gaaaggtttt tccagtgtct aagacaagtc ctgatgaaaa 
64141 aactgtttgc aacagtataa acaaacaaat atctaaagta caagataaat gtatcacaag 
642 01 ttcaaaacga tggtaaagaa acagcatata tatacatcga tactcaaaca aaaaaaaaga 
64261 agaaaattca aaagaaaaag atatatcaaa tagtagtcta catgaatgtt caccacacga 
64321 gaactatgtt gtaggttaac ttaaacgatg aagaaatata tttatctttt tgaaaatttt 
64381 ctagataaga gaagaagaaa gcatgtcact ctaccggtac aaatacaatt ttcatcggtt 
64441 cagaataatt tgttgcaaac actattttct agttaattgg ctgggtttct tatttgggct 
64501 tatttacttg ataataataa agcccacaat ataattgatc aggcatgttc cgcgagttta 
64561 aaaacactgg cttaaaatgc aaggaaggaa taaaggatca tatataagat ttaggttcat 
64621 tgacttcagt gacttgagta actgagatcc ttttttaccc cagttttggt gcgaagaaac 
64681 ttactgatca agcatacata acttgagccc ttgcacattt tagtactagc tagttcatat 
647.41 cacttgtctc tgtatctggg atcttagtgg gcatgtgcta agcccatctc caacacaacc 
64801 tcacaaattc aatctttgag agatgagtag aggtatgaat agaagtggct gatatcgaga 
64861 gagaatgcct gataaagagt agaagaaaca aggttaatta actcctttaa aacaaaatta 
64921 taaaatcaca atatattctt cttgatctaa aacacgcaaa agcatgcatt gtatcagcat 
64981 tctccaatac atatgacgct ttcgttgtga tccatagaca tcaaatcgaa acgtgtgttt 
65041 atctgtgtcc acacaaaacc agctgtccga tttacctttg atttcaatct tgctcgtatc 
65101 tttgcttttc agatcataac agaaaaaatt gtcattttga gttgcactta taacctttaa 
65161 tgtaatacca ttgatcaatt gtagctgaga aggctacaat aacagagtct tcctgactat 
65221 tccttgtttc caaaacccat aaatccacca aactcttgct ctaagattgg tttggtcaag 
65281 aagagttact tctcctcaat attctataag agctcccatt cgtgacattt gtaatcagct 
65341 ctctccatcc ttgtggtcca tttttattat atggctttat atcccctttt ttcttaatat 
65401 gatagctaat ttaaagcaca ttttgaaaca atgcattttg tgtgagtcta taaaagagtt 
65461 tcaatttaaa attctacaca atataatttc atgatttcct ataacttttt ttttgtgcaa 
65521 ctcaattgat taaaaactta gatgggttca tacaattggc ccgataaacc cattggagaa 
65581 agcagattag gctagtctat ctgcaggccc atgattttct ataactattt taacataaga 
65641 tacgaaaatt cgaatactgg ttattcattc ataacattaa ttgtaaatcc agccaagcat 
65701 gccatagttt cagttgaatg aaattaattg taaggttgct agctttaatt atattagttc 
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65761 ggaagatata aactatacta taagtgtagt ctattgtttt gaatagagaa aatataaatt 
65821 tgttttaaca gagtccttct tcccccaaat aaatcattaa ataccttcac atgggaaatt 
65881 aactcacact gcaaatatta atagcttgct tttggtgtga agctacgtga tattgtagtc 
65941 ataacattta cttcgacatg gccatctttt tactagtatt ttataagttt cttcacaaga 
66001 ttttgtaatc ctgtgattaa cattaataca cattcaattt tcttcctgct tcgattaaga 
66061 ccctttactt gtacgttggt ggtctacata aagacgtata catatctgtt ttcattaaat 
66121 acaatcaaag actattcgtt ttcttctgtc aacaaatatt attcgtaatt cttccttctt 
66181 gataaatact tgtaccaaat gaaatttcgg agaaaattaa agaacactga caaagctagt 
66241 tacttaattt aggactcagt tttgttttat agcaaatgta ttacgtgttc gatttttgta 
66301 tatataatat taatattttt ttttggaaaa taaattttgc atactatgaa cacgtaacat 
66361 acatacaaat atatcactgc attttaacca tatttgataa acacatttaa aaacctacga 
66421 aaagttcaac aaaattggat agtagttaaa gcatttacaa aattaccaac tgtgaaccaa 
66481 gctgaggttt gtttagtgtt tgtgctatct gattttagca gtgtagattt aagaattcac 
66541 gttaataaat atagcacatt ataatcagta aaaatatttc aaaactgaaa acattaatta 
66601 aaaaaaagag actaaaaata atttcaaaat caaaaccacc gactaaagtt tgtagtatqt 
-66661 tggtttgata ttagtattat catcatgcca ttatccaagt ataagaaaaa cataattata 
66721 aaatgtatgc tttaatatta aacgatttgt caacattgcc atttaaacag aaaaagtaga 
66781 attgtttttc ttcaaccggt agttaagcag aaatagttgt tggaagtaaa agactagtct 
66841 acgtagctgt catagttatt tcacctatcg taattaaact gagccttaat gaaaatcaat 
66901 ctctctctct ccctgaacag tggtctatgt gtatgtgtgt gtgtttgtgt ttataaaacg 
66961 gccttacgag aatcaacaag agtaaacttc tcaataaaac caaaatctct ttcttaaccc 
67021 ccaacgaaag aaccaatgag ccagaggaac aagaacggcc ctaagctcga gctgaagctt 
67081 aacctctcgc cacctccttc tcaagccaac ctgatgagtg tagttcgttc tccaagccqa 
flit 1 tccaacacaa cttcgccaag ctcatgcgtt tcatctggaa cgtaccagga ggagatggag 
67201 acaacaatct caatggtact tgttggttgc cctcgttgcc ttatgtacgt tatgctctct 
tilt 1 caa 9 a tgacc caaaatgtcc aaaatgcaaa agcaccgtcc tactcgattt cctcaaccaa 
67321 gacgcctccc ccgctacaac tgctcctgcc gccaccactg aacgcaataa gacatggtgg 
67381 aaattttttg tttaatccaa ccctttgtcc gaaatataaa cctataggct agttgatcca 
tltt 1 a = c 9<: a cttg tttctatata tttcttttgc ttcttctcgt ggttgtttgg tgttagagta 
67501 gtgctaatca cgctccacgt ttagaccaaa cattagcact aactttgctt ttaatqcqat 
ac 9 aa attat ttgagcaaat tatgttcagt atgatgaaac attcgcactg tgtgtctatq 
67621 tgcttctaaa cctttgcacc ttggagattt gaaagactaa tgtcctctgt ttcttaaaaa 
67681 aattacaaac ggtgtttctt gaattaataa actgtaaaaa gaaacattcg agattcggtt 
67741 ggtttgttga acataggaga tacagagaaa acttattaca acatttatcc cggttcgaat 
IVLI\ ^^ tt9a f actaaattct agcaatgtga aaagtttgtg tttgaattcg acccaaacat 
Hit, "tataaagt a 9 tct 99 ta a catgaaattg cgtgtgacat gacattaatc agaatgtatt 
III 1 ttaaattcaa attaaaacaa tatgataaac agtccatttt ataatattaa atatattttt 
67981 ttatgtgtga tcggaccaac tgatagaata tattttttaa aaagatgctg aattttttaq 
ll.V: 9 a 9 aaatttc atatatacac tttttgagca aatgatgttc agtatgatga aacattcgcg 
? ?5? t ?5 Ct at 9t9cttct aaacctttgc accttggaga tttgaaagac taatgtcctc 
tgtttcttaa aaaaattaca aacgttgttt cttgaattaa taaactgtaa aagaaacatt 
looo?" ^agattcgg ttggtttgtt gaacatagaa gatacagaga aaatttatta caaacattta 
tcccggttcg aatctcattt gaaactaaat tctagcaatg tgaaaagttt gtgtttgaat 



HIV: £1 l S l ca "ttataa agtagtccgg taacatgaaa ttatgtgtga aaEgacatta 
HV£ atca 9 aat 9 fc atttcaaatt caaattaaaa caatatgata aacagtccat tttataatat 
68461 taaatatatt ttttacgtgt gatcggacca actgatagaa tgtatttttt aaaagttcat 



Jtll^ taa 9atgctg aatttttagg agaatattaa attcatttag gtacctaaaa acatttttaa 
ZlJtl aaatacctaa atcattaata ggcaaaaaat aaaaataacc tctcataatt tcaatctaat 
«™ ta ^ at 9 t " c tttctattcc gataaaaaac tttgataaag acctagttga aatggttaga 
llZl\ f fttt t aactaccata atgtcaaaaa aatctataaa atcatttggg tttgEgcaca 
foZo^ ta 9 ttttact caccaaactc attacgtttt acctaatatt catattgaaa agaacgacca 
68821 aagaacaaca atcatttgca tatattcttc ttttttggac caaaccatca tatatctaaa 
caacaaaa 9 a aacgaaaatt gtatttagca accctaatat gcgcatattc ataggaaaaa 
^ ctaataggct ttcagacttt agtttgtagc tactgataag tctaaatgct gaattattat 
tln^ ata "aaata caaaattaat gctaaggggt tgaaagttgg ttaagattaa agagaggtag 
llltl " Ca ^ a9at taaaattaa * taattcatta aaatlaagtt taaatgcaaf 

fi^ifli tl I 939 9 ^ agttgaaa ataatgatcg ttacgtaagt atgataaata cgaatttaat 
Mll\ tcctccccta ataaaattag tataattttt attgtcaatt tttttttatg ttgtgtaatt 
Vslol Illicit tt-^"^ Cttctcttac cagacattta tcatcttaal ttStlactaa 
HXVl III 9 I "acttaatg caactgtcgt agcaattaaa cgaagtagat ctgtttgggc 
69361 cagaccaact tgcacagcct ctcctaatca gcccattaat tgttataaac gcttttHtg 
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69421 gtcaggaatg taaaatcaca gaccggttta aagggtttaa ggaagaagag accgtgaaat 
69481 tttacattta ctggagatgc ttaaatcggt tcttgatcag agtggctacg tgctgcgcaa 
69541 cccacacggt gccagccacg gtgcagcaga atccatgagt ttgaccttcc ctctcgatag 
69601 ataaggaagt accaggcgcc tgctttgaaa tctgcacagt ggagattcta agtgagacga 
69661 ctagaaaaaa aaactgttga atacgagaat gcgcttctgt cctttttgct ttgaacacag 
69721 ttcccatgaa aagcagagtt agaatcaatg agttgccttg tctttgcaat ctgctgatgt 
69781 ttctgagaca agctaagagc agcatcaagc tcaagaagcg gttggttgcc tccatttaaa 
69841 ctgagaagtg actataagta gaaccggaag aaacacaagc atgatcgatg accaatcaga 
69901 agattcacga atagcctaac cgatcaacga gcttctctaa cagttcaatc agtctttgct 
69961 tttcgtcgac gagtgttctg tcgtctaaat aaagatacgt agcttattag attaattcat 
70021 ttttttcaat acctcgttaa atttaagaga cgtaataaga tgcgccaata aacatgctct 
70081 aaagcaggtt tcagacagta atcataaacc agaggaagtt tctaaagact agaactattg 
70141 gctgcgattg atgactgagg attagaacct tctccacccg atctgagata ccaccgcctc 
70201 tgaagttttg ttcatcgacg caattgaaac aaccacagcc ttagtccatc cgaggcgatt 
70261 gaatagtgat caacaaaaaa gctcctcgcg ctgccatcat cggggccggg ggtgatgaaa 
70321 ccgtagcctt tccggtcaat ctcgcattaa gccctaattt cgactttgac cttagatccg 
70381 ccacgtcagc tcccttctct accaatcaga ctttactttt tcgtatttaa tatttttttt 
70441 atattcgcta cgatttagac ctgtttattt tattatatat atatatatat atattttttt 
70501 ttttatattt taaaacagca aaatttatat attaagattt ttgactccac gtgtacactc 
70561 caagtattgt tgagctggac aaactgaaac aaggcagatt tttcctcgta gtttcttggg 
70621 atttgataat tatgtaaaga atagtctgtc aaaaaaaaat tgatagcttg cactatctat 
70681 gagcattaga aaaaaacgta cctaagaaaa tttgtgtatt tttgaacaaa aaaaaatgtt 
70741 gtattttttt ttcactaact aagaaaattt gtttattctt taaaaacgta cataagaata 
70801 tttcatcact tccgttgtca atcggacatt gcatacaaag agaacttttg tctgtgtcga 
70861 gtttgttttt ggtgtgtaca taattgttaa atcagattta gttttcttat tttcttaaca 
70921 atataatata acggcatcaa atgtaaatag aataaattta gatggtttta tgtattttag 
70981 gacctattca agcaatatgc gggtttaatt agagttagtc taataatatt tcagtgaata 
71041 aagaagaatc taaataattt tctcacagtt tatacgaata ataaatccta aaataatttt 
71101 tcaaacttga atggattact ctctacacta aaacatgggt tccaacctga atggaacttc 
71161 caagaaagag gagagagact gtcagacggt ggggaccggg tagcggacaa atagagtatc 
71221 tgatacttag attgttcaat ggtgacaaaa tgaagaatta gagtatctga tacctctttc 
71281 tacttaaagt gtatctgata gttttttttg tttttgactt aaaagtgtat ctgatagttt 
71341 tggatgtttt tttgttctgg aaatgtgtat tttaaatttt gaaattttca tctgccacaa 
71401 aaaactgtac agtctctatg tttcaagcta gtatattttc aacttataaa tatattattt 
71461 tcaaactgat attgtttaat atcattgtat taataataat taaatatatt taattatttt 
71521 tattcaaagt ttaaatttat tatcataaaa taataaaata aaacacaaaa ttttaattta 
71581 aaaataataa taatttaaaa tagaacattt taaaaaattg cagagagtgt aatttttacg 
71641 ggacttagag agatattgat atatatcaca tgattaaaat tagattaaca ttagatacat 
71701 agaaaattct atttttaaaa agaagatgaa gatcttatct ggaatttttg gtttaactac 
71761 attaatagat ttttaaattc aaacaaaact tggtactgtt cttttatgat tttaaaattt 
71821 actttaaaac aatgtgttcc tattctgtga tttatatcca attcaaatca tatatttttc 
71881 aatcctatag aaataaatac ataatatatt ttgtattgta tttagaagta atctttatta 
71941 aaataagaac taaatccgag agattttttt tcgatatgta aatattaatt aaaattcaca 
72001 tattagattt gaatatcaga aaatgtcctt taagagcatc ttaaaattga ctctttattt 
72061- tagagtttgt gaaaccctat atttgaaatt tcaagatatt tttttccaaa agtaaaactt 
72121 caattttttt ttaaacttca aacttaactt caaaattatt tgtaatttgc tttatagtct 
72181 ttttatttgt cataattaat gtaggtccat aaaactttat aaataactaa catataaaaa 
72241 catattacaa caatattaat taataatatc taacacaaca atatcaaatt agtaatataa 
72301 atacataatt aaatattaaa atacaagaaa atacgacatt attacctaaa actatttatg 
72361 taatgctcta tcttcggtta ctcaaaattt gtttggataa taatttagag atttgagagg 
72421 tttcagagca aattttccag acattagtgt tgtaatattg aaatatgggt aataattttt 
72481 ttcttcatgt aacttttaaa aaaagttatc aagtttcttt tgtaataata tatatttttt 
72541 aattttttta atttagtttt actataaaac ttgaatttat aaaaaaaaat ctgaaataat 
72601 tatgaaatat gaaaaattgc agattaaaag gataaacaaa aaaatactta aaaatcataa 
72661 atgtatgtgt aattaattgt aaaaatcaaa atgcaaataa aaattgaaac ttctttgatg 
72721 agtgaaactt caatctcaaa actctaaatt tgaagtttta aattttcttt ttatagagca 
72781 aaaagctcta tattttaaag ttatagaatt gtttttacag atgctctatc aatcccactt 
72841 tcatctttcc ctcacagact aaacactctc taacaacttt gccaactcca ctatactaaa 
72901 ataaaaaagc aacagcgtgt gtttagaatt ctccgtgcgt gtaatgattt cctttttatg 
72961 tatgtcaaaa aagaagaaga aagatatgta cttttttttt gctaaaaaga tatgtacttg 
73021 tttgacaaaa aaatgtacat tacctcaaaa acaaatgcat taaaagattt atataaaatt 
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73081 gaaacattac gcatgtattt tagattttaa tgcatttggt tttgagataa tgtacatttc 
73141 actgaaatat tagaaacgca ttaaagaaat gtataattta tataaaacaa atgcattaaa 
73201 gaaaaatata aaactgaacg agaacctaat gcattacaga aatgtacatt accacaaaac 
73261 cattaatggt taaatttgtt tatcagcgaa gttttttaag atcatgtcac ttgaagacta 
73321 atcaatcata tttgtttact cactagagag accaatgttc cattagtgag gttaagagag 
73381 tatctaataa tttttttttt aaatacaaaa ttaagagatt gaagacggag atattttaca 
73441 actgttccaa tttttaatac tcaaaatcat atttactttt catctattgg tccatctatt 
73501 ttaaaataaa tcgaaattaa taacaacaat aaaatatcat tatttttata ttttaggaac 
73561 taaaaggatt tctcatcaga aaaggtttct catctaaata atttttcata atacacacgt 
73621 ttcagaatta aaaacctcta ttataatcta gagaatttga atggtttgaa gaaagaaaag 
73681 agaaattacc ggactgtttt gaactcaatg attgacccga cgctgaggaa ataaagaaca 
73741 tataaaacaa aagaaaacaa aaagagttag tgaagtaaaa tataaatgaa agattattgg 
73801 aggtataaga acacatggta cataaatcta taaaaaaatc ttctttgtga atgatatttc 
73861 attaaaatca gaaccacgca ataagagcgc ttccatcgat ttaaccgatt taatattgat 
73921 agttttaata— tgatttaatt attgattaaa ttttaaatca aaaacaaaaa tcaaaccaat 
73 981 catataaaga caaacacaaa gacaattcca actttttttg tttgttggag tttaaagcac 
74041 aatctctttt ttttttcttt ttgtccactt ttctatttct tgttactttt taattaatat 
74101 ataaatagtg ctctgaaact cttctagatg tccttaacaa taacgaaccg ttcaaaattt 
74161 taaaattgta aataccagtt agaccctatt tatttatttt ggttcaactt aaaccattat 
74221 caatcttgcg aaaaatattt gtatcacatc aacttgtttt agttagtgtt attaagtcct 
74281 tatttgatcc ctctttctat aattataaaa aaattattct cctctaagcc taataaattc 
74341 cgatctgaaa taaattatac caaaatcgaa ttttttaata aaatataaga atggatcaaa 
74401 aatttaattg tcaaataata gttcaaactc aatctaaacc aaaatatttt ggatatccaa 
74461 aatacttgaa tcacaattat atatatattt gaatatacta atatttgtag atttaatatc 
74521 caaatacact aaaatatttg gaattatcca aaacacctga aaaataaaaa atttatccaa 
74581 aaaggtaaaa ataaatatct taaaacaact aaaacactca aaacattaaa aatataagta 
74641 atatataccg tttctccacc caaatatcca agccgaacaa attttcatgt caattttaga 
74701 tatttggtta tacaatattc atatttatat attatatatg gcattatttt tggttttgag 
74761 attttaaaat agatttggat tattgttgtt tttacataat ttaaatatgt atctgaaccc 
74821 aaaccaaatc tctaaataat ttaagttatt tacgtatttt agactcttga aaagaaaact 
74881 tgtaattact taacatttag atttataact tcaattaatt atctacagaa aatttgcatt 
74941 caaaattagc aagagtgtgt gatctagttt gcattataat taaaatagta tttttctacc 
75001 cgtacatagc aaaacaaaat ttaaataaaa catttcaaac taattcaatc tagctacttt 
75061 tggcaagcta agctctgttc tttctcttcc tgttctctgc aaacaaatat cacgagagtt 
75121 gaagctgctc tctctctctc tctctctctc ccctctctct ttccggattt cttcccctct 
75181 tcttctctct ttcgctttgg atattttgat ctccaatggc tgccaaagcg tttgcttaac 
75241 gaacaggttt ccctcccttc tctctttcac gtgtcttgtt ctggtgttct ctctctctct 
75301 ctctctctct ctgtgtgatg gagatcgagg agctcgagct tacacctctg agacagcctt 
75361 ctccggaaat gaagcagagt tgggttctca aaccaacact ctccaagaaa aagaagaaac 
75421 tttctgttgc ttgcatctca agcaacaaaa ggttacttgt atggcttgga ggagtcttgg 
75481 ctgtgtcgtc tatcgtcgtg ataacactgt tcaagacact cccccaccac cagagcattc 
75541 caccaccgca agacaatagc accatcgcac ttcctatggc attgaaattt ttcaatgccc 
75601 aaatctgtaa gtagagatgt tttgaatcaa agtatatgac caaatgaatg taaacgttac 
75661 tcttttgatg ttctagccgg aaaactgcca gaggggaata acgtgtcttg gaggggaaat 
75721 tcttgcttga acgatgggaa ttttcccggg agtctttacc cacatctggc gggagggtac 
75781 tacgatgctg gaggttcgat caagacaagc ttcaccatgt ctttctcaat gacaatgttg 
75841 agctggagtg tcattgaata tgggtcgaaa tatgaggctt gtggagaagt ggaccacgtc 
75901 aaagggctca ttaaatgggg aaccgactac ttcctccgca cttttagcag cagttctgat 
75961 acgatctatg aaatggtgta tcaggtataa aagttttgtt tcccttcctt cttgcatact 
76021 caattcggtt caggtcctaa acttttgttt ttcttttcac agatagggat gaatcaagga 
76081 agccaagtga ctagtgacct atactgctgg atgcgaccag aagacattga ttaccaaagg 
76141 tctgtcaatc tatgttacac agactgcccg catctcgctg cagagatggc agcttccctg 
76201 gcctcagcct cgatcgtatt ccgtgaccaa gtcgactact ctgcaacact tgttaacagt 
76261 gccaaagccg tttaccttta cgccgaggct atgagcgcca agcggaaaag cgcagatcac 
76321 tgggatgacc tcatatgggg gggagcgtgg ctctactatg ccacgggtga taactcgtat 
76381 cttgctaagg taaccagtca cgatctagcc aaccgtgccg gtgccttttc ccatggccct 
76441 cgttatggtg tctttggttg ggacaacaag cttgctggga cacaggtttg ttagttcacg 
76501 aggcttcctt agttctgttt ttcttccaac tgattgcttg cttgcttgct tactctctgt 
76561 ttttttttct gtgtgccttt tttagttgct tttcactcgg ttgaggctgt tcttgagccc 
76621 tcccttcccg tatgaagaga tgctcagggt ctttcatgag caaaccagca tagtaatgtg 
76681 ctcctacttg ccgtattaca ctaagtttaa cagaacaaaa ggtatcagat ttttgccttt 
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76741 gttattcttt attactaagt tttgattgtg tgtatacttg tgaaagtgaa actattgggg 
76801 agtgagtttt tggtcttatg tgaagggtta tttagttagg atgttttgtg aaggattatg 
76861 cgacttgatt tagtcaggat gttt tgatct tttttagtat tgagagtgta tatcgtctgc 
76921 tttcatatct gcaggtggtt tgatcctgct gagcgaacca gagcctctcc agtatgctgc 
76981 gaatgcagct ttcctggcca ctctgtacag tgactaccaa ggcgcttctg acgctcctgg 
77041 atggtactgt gggccaactt tcttcaaaac tgagatccta cgtgactttt cgacatctca 
77101 agtattgaaa ccattaccat tatatgatca tctccacagt tcgatcttat ctgatttaag 
77161 cgtttgtgtg tttcttcttc acgcaggttg attacatact agggaaaaac ccacacaaca 
77221 tgagttatgt ggtggggttt gggcagaaat atcctaaaca tgtgcaccat aggggagcct 
77281 cgatcccgaa gaacaagaaa gtgacatgcg aaggaggttt gaagtggaag gagagcacga 
77341 gcgagaatcc aaacacgatc gaaggggcaa tggtcgctgg accagacaag aaggatggct 
77401 tccacgacgt acgtgtaaac tacaactaca cacaggcgac tctggtggga aatgcaggt^c 
77461 ttgtggcagc tcttgtagcc tcttcacgag gaggtggagg atttgataga aacaccatct 
77521 tctcagccat aaatcctctg tcgtttgcac cgccgcctcc agtacctgag acctgagaag 
77581 atttactctg tttttatgct tttattattg gacttgtttc aagatatttg ttgtgatttt 
77641 agctcaggtt tgtatctctt ctcttcttta ctcttgaact ctctttttct ctgt.tttgtt 
77701 . gtgcaagact tgattttacc aaacaacttt ttcattcatt cttgcataca ttcttacaaa 
77761 caaacaacac acactcttta tagtagactt cttgctgact tacaactaac acagcacatg 
77821 acacatacct ctagtcttta actattacaa caactctctt ctcaactctc taatcatatc 
77881 tgaacttagt tggcttacta accaactaac tcttaacttg ctacttaatt ttaacagcta 
77941 gttacataac taacttcctt gttgaccttg atctttgact tctcttcttc cttgcttcat 
78001 ccacagtatt tgcttattct tcagaacatc tcaaagtcat cgccatcttc ttcttcctct 
78061 tcttcttgct tttctatcat cttctcatta tgatttttgt cttccttttc ttttgcaggt 
78121 ttactcaagg aatcacttct caacaatatt gtttccactc aaatctgttc cttcactgaa 
78181 ggagacaaat ttataatctc ttctcaatat tgtaacacac aagtggtagg aaagacttga 
78241 aagttgatct ccttgtaatc cctgacgtct ctcagagcat catatctcac actcagctac 
78301 tattttttct tcaaccttgc ggagagtttc aagttcagaa acctgactat gtgtcacatt 
78361 ccaaagctat cttagtgatt tcatgactat aatgctgatc aggattctcc tcagagaaga 
78421 tctcccacaa tgttttcctg atgttttgat tcgatattgc ttccttgagc tctatttagt 
78481 ttatcactcg caatgtagat gccgcttcaa aagcttccag agcagagtag ccttgcagag 
78541 caacatggcc ttacctgcag cctctctcac agccatttga gtctcatcaa gagtctggtt 
78601 cattccacca aataacaact taccacaact tgacaactga catagcaacc tgttaccgtt 
78661 taccaatcaa atgacattat ataatacttc ctattcaagt atgaatcccc tttttcttag 
78721 agctgatgtc tctctcggtg accattaaga aaccagtttg gtatatgaaa atcaatgata 
78781 atgtcacact taacaagaga gaaaatcaat gacaatatca cacagagact ttaaaatata 
78841 gccaaggcat caatcaagat caagcaagtt cggatcttca ggcaaaaccc taaattcaac 
78901 tacaaacaac cgagatccat ggcggctttg atctgtagct atgaaaaaat cgataggaga 
78961 gaggagaatg aaagagttag ttattcttgg ctttgagttt tgggagggaa ataaagtagc 
79021 cgttacggta tcaacggatc gtttttggaa ttttcaaggg gcttatctca tgtgtgcggg 
79081 tgcacattag tattaacata attgttaata taacatttta aaacaataat tttatttatt 
79141 attttgaata atcatatttt gtgattttaa tcatctatta tatcacagtg taacatataa 
79201 aaatccaata tttataacta attgaactta tattacgaaa gcgttatact aatgatttat 
79261 gaatattttt tgtactaatg ataatatata tatttgtaaa ttaatatgta ttacataatt 
79321 tttaatataa catttaaaaa acaattttat ttattaattt gaataattat attttgtgat 
79381 tttagtcatc tattatatca tagtgtaaca tataaaaatt caatatttat aactaattgg 
79441 acttatatta tgaaagcgtt ataataataa tttatgaata aatgttttat attttattta 
79501 tatgttatat aaattgattt tgatgtgtgg tttttatttt atttttttat tattaataaa 
79561 tattgaaatt aatatgttaa taaaaatcta ttaggaaatt tattttggtt aatattcgat 
79621 taagaaaatc tattatttaa attagggaag acattaaata cttaaatcta ttatttaaat 
79681 aaggaaaaga caaaacctcc ttaaatataa cttcatttaa tatttgagtt gcatgacatt 
79741 ataaataaaa taaaaaaatt caaggctaat tcaattttat acttttattt taatagtata 
79801 gatgtttgca tttttcaaag cctctaaacg agactttttg atggctttgg aatttttttt 
79861 tatttttttt ttaacttttt tctcaacaag ttaaacactt aaaaataaca aaatatttaa 
79921 caaaattaaa aataaatcca ataaaaatca aaataaatct agataatatg ctatactttt 
79981 tttaaattaa atatttgttt ataagataat gtattaatta atttcgtgat tagaaatata 
80041 atgttatttt aatacataaa taaatataat aattatataa aaataaaatt atatgctata 
80101 ttttttaaaa gctttataat tacagtgtat agtatattaa attagaaatc ctctatgaaa 
80161 taaatgtaaa attttaaatc atgtaatgtt tttatacaaa caaaattgct gaaatatatt 
80221 tggtcccaag ttgaaccagt taaaaatatg ccacaaatcg ataaagataa aatatatatt 
80281 aaaatatata caaaaacggt ttgtaaatta ataattctat aaattaataa aatatcatag 
80341 ttacaacact attaatttat agggttttta ctatatttat aacagttaca atttctaaat 
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80401 cttttgatgc cagtcaagct ttaaattttt 
80461 aaaatcccta ctgctaaaat tctaaagtca 
80521 agcttcaata tttttttttt ctagccgtcg 
80581 tattctagtc tccttttcct cgcagtccta 
80641 gggagtactc cattcaaatt ctctaaatta 
80701 aattttgaat ctagtgtaaa ttatgagaaa 
80761 tattactaga ctaacctcat taaactcacg 
80821 aatcttctaa aattatttta tatacctttg 
80881 tttgacttat ttgtcatctt cttgatttta 
80941 tttatttgtc atattttcca aaattttagg 
81001 ttttagtttt aattatttat catattttcc 
81061 aataatttta tcggtaattt ttatatattt 
81121 ttgggtttat tttataatca attgttaatc 
81181 atttttatta ttttgagaat aattaaaaaa 
81241 tatatttaat atattaattt taaaatatta 
81301 aaatcactaa aattacttat taagatttat 
81361 tacggagaga atgaaattta. atattactaa 
81421 ttatcattag attttttaaa atattctcga 
81481 aaattattta ttaagatcga taaataatta 
81541 agcatgcaat ttaataatca atcaaatatt 
81601 atatatttat tacgtaaaat aatcttgaaa 
81661 tgatttgtat ttattttaaa aatagtacgc 
81721 gaatcgtttt agataatatt gattataaaa 
81781 actatattat atgatatata ttaaaatata 
81841 tgaatagggc gaaccactcc gtggatattt 
81901 ctcggtcaac tggaatgtgt attatccata 
81961 gacctgagac gaagaaaaat gtctatctat 
82021 aaaatggata acgacaacaa tttcatacga 
82081 atattcttca ttaatggaaa ttcttttgat 
82141 tttaagaatt cttgtttaga cagttcgtac 
82201 ccaactcttc catgtttcca tcgtagtata 
82261 gtgtttctac aattctacca cccagtcaat 
82321 gttacattac atgaatcata aacaaaatat 
82381 aatataagat aaatttaact atgatattaa 
82441 gcaataaatt atcacctaaa tcagtaaaat 
82501 atatgactaa acctaaaatt atggagagta 
82561 tttagggtaa ttctctcaaa taaccatttt 
82621 agtaaaaaga ccaaaatagc ctcttcttat 
82681 tttgaaaccc tattccaaaa cccacctctt 
82741 tagctaaccc ttagggtata aatgtatatt 
82801 tttcttcctt gatagctatt tttatgaaaa 
82861 ctctaaaatt tatatacaaa atttt tttat 
82921 tgaaaacagt acatattcat aacactatat 
82981 atgttgtaaa atatttttgg ataacataat 
83041 aaattaatga aattcgttta aaatttatgg 
83101 taaattagat tcagtttaat attaccaaaa 
83161 cacacacaca cacacacaca cacacacaca 
83221 cacacacaca cacacacaca cacacacaca 
83281 tgctcataaa ttttataatc atcattattt 
83341 agatattaca tcgtaatatt ttaaaaatta 
83401 atatgaaagt tttcaagttt attttccgta 
83461 attgtatata aaaataatat ttaaataatt 
83521 tattttaata gtcttctgaa tttataatat 
83 581 agcccgtaag tgcgggcaaa acacctagtt 
83641 atataattga aaacaaggag aaaaaaatag 
83701 tattgtaata gagtttaaat. ctaaccataa 
83761 tctattttaa atatagtgtt attcaatgaa 
83821 ttattcaatt tttaaggatt gtaatttttt 
83881 tctaatgaat aaatgagtga actcaaaatc 
83 941 tgaactttaa attttaaaat atgatggatt 
84001 gtttaataat agaatgactc taaatttatt 



ttaaaagtaa cagctacaac ttttaaagtt 
tagtctaaag cgaaaattgt tagcaatcag 
gataattttt gaattcaaaa taatatcaga 
caatctttct tcgaacccat gtttcaatta 
ttttagtttt tttctatcag agttttctta 
attattcaga ctgatcttaa ttcactggaa 
tattgcttga ataggtcaaa catacataaa 
gcactatcta tattattatt tatgtagtga 
tattttttaa ttatttttca ttgattgttg 
taattttctg atttcttatg tttttaataa 
acatttaggt tatttatttg tcatgttctt 
attatattat tatataattt aggtggtaat 
ttaataagta atttagggat ttagctgata 
ctaatgatag tatcatattt aaatttatat 
tatatataag ataaaaaata attatcagct 
tattgattat aaatttaaaa aaactaaaat 
agtaaatata agataaattt aactatgata 
gaaaacaata aattatcagc taaatcagta 
taaaatatga ctaaatctaa aattatgtag 
aaagtttaca tgcaactatt ttaataaaat 
acattcccat atatataaaa ctatatacat 
tgtaaaaata tttttggata acataatata 
ttaacgaaat tcgttttaaa tttatggata 
attagtttaa tattactaaa cgtgaatata 
ttatttcaaa acaatataat tagaattaga 
taggggattt ttcaattgag aagatctatc 
tttatttagt tattcagttg attcgttatt 
catgcgtatt tttgattttc caatggattt 
gtagtgagta atagttatgg ttgttcgctg 
catccataca taatgttttg atctaatatt 
ttgttccatg gaactaagtg gaagaaacat 
ttcggctaat taatgagaaa cctttctctt 
aatatacaca tacatatatg tactaaagta 
aattagtttc tttaaatttt tctcaagata 
tatttattaa gattaataat taatcataaa 
ttaagtttaa taattaatta aacattaaaa 
aagtttttgt cacgaaaata gcactcaaaa 
tttaaaaatt tttattttta ttttttaaaa 
aactctaaac tctaaactct aaatgtagat 
ttacccttta ataaaaattt atttggtcat 
taaactaaaa aggattatct aatagaattt 
caaaatttat tttatcatgt aaagtaaact 
acattgattt atatttagtt ttaaaatact 
atagaatctt tttagataat gatgattata 
ataattatat atttatatat atacacacat 
ccaaatataa tataaacaca cacacacaca 
cacacacaca cacacacaca cacacacaca 
cacacacaga tatatcacaa aatatataat 
aaaaagattt taaaattatt ttatccaaaa 
aaataaatcc atgtatataa ttttatgtat 
ataaaagata ttttgttgaa attaaaagtt 
attaaatatt ttcaaaatat aaaaataatt 
atctatttac aaaatttggt aagattatca 
atattatata attgaaaaca aggatatatc 
ttatttatta gggggtttta ttcgttgatg 
tctactgtta tttgaatgat gatttaacat 
tgatttaaat tcagttttta aaatctagtg 
ttgtattttc aattgatttc aaatcatgag 
aaaggtacac taaagagatt ttagaattct 
gaaaatcact gatcgaataa cactacctaa 
ttaattacta gtttagattt catgcatgtt 
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84061 atacccttaa gttcattgtt ataaaaaata gtaaataaaa atttgtcatt actcaaaaac 
84121 ttattttatt catattttat gtagccacat acttatcttt atttttcttt ggcgatagac 
84181 ttaccttaaa atcgatagac aattagattt tcataaatat gttatgtttg gtaaaatatt 
84241 tattaaatat tcatagattt aattagattt agtttctttg accacatatt tactaatttt 
84301 gatagatttt tttttgtttt gaccacctat ttacttattt ttgacagatt tatttggatt 
84361 tcaactgatt tattatcatt tgaccataaa tttaattaat tttgacatat ttaatatgga 
84421 tttgatagat ttatttattc tttgaccaca tgtgtaatta caatttgata gatttttgta 
84481 tactttggtg acagatttta tagttattca gtcacatatt aattaaattc accttaacca 
84541 aactggttat tgaaaactgg atctaaccat taaattctaa aattttatat aatttaaatt 
84601 gaactaaacc aaaatatttc taaccaataa attaagttaa ccaaatcata acaaaaactt 
84661 aatttaaacc aaagaagaaa aaaaccctaa tctacaaaag aacatcacgt gtcattcttt 
84721 agcgagagat tacgtctcct atgtctttca ttgttttcgt ttttcactca tggctctggt 
84781 aaaattgttt gtggtatctc tggaatgctt cataatcaag aactctcttt gcacactcta 
84841 acgggaggca aatccttatt tccctatatt tatttcatat gtcaatctct acagggcaca 
84901 tttttttaat tttatatatc aaacaaaaaa tcataaatta aaatggaaac^gtgaatat tt 
84961 tttatttgat ttatagaatt tgataaaaat atttcaaaaa tcaaaaaatt gttttttcat 
85021 aatttcgcga aaatcatcaa aatatgattt aaaacgattt taaaatatat tttttccaaa 
85081 tttgaaaggt atagaactta aatatttgcc catggtttca ctaatccata gggggtgtta 
85141 gggttttgtt ttgaagagag aagatgattt gatttgggag tttaagtttt ggtcaaaaaa 
85201 aagaattatg aaaaaaagat ataggagatc aagtgaatcc attgaaaaat tgtctctaga 
85261 gaaatagggg aatgggaatg ggaaaagttg actcctctac cagaatatga cgtgaggaag 
85321 ctattccata gacagcaaga tgaagaacac ataacaatct aaccattgtc ctcttctcga 
85381 cactccaata tttgcatctt tccaatacta actgaagctg aaccatgtta ggaccaacaa 
85441 cggtaggaac ttctttcttt ccaaaactct gcgtgatcaa cattacaaat gtcctcttct 
85501 gaagctgaac catgtcctgt gatcgctatc acgaaattct gaaaaagaga gacgaatgag 
85561 ctgcacatca attagaggtg tcaactgggc gggctgggct gtccatggat tagaccggtc 
85621 caatttagtt taggtaggct atgaacgcta attggtctaa ttgtacaaaa gaccaattgg 
85681 tctatggtct aactgggcaa cgaccaaatg gacaatgggt' gtccataggc ctcctagaaa 
85741 tatagttttc caatttattg aaaaacacaa ttttaacgaa aaatcaaaac tttgcgattt 
85801 aatagaaaaa cataagttta cgattttagc ggaaaatgta attttacgtt ttggcgggaa 
85861 aacgtaattt tatagttttg gcgggaaaac gtaattttac ggttttagcg gaaaaacgta 
85921 cttttatggt tttggtggaa aaacgtaaat atacgatttt ggcgggaaaa aggtaatttt 
85981 accattttgg catgaaaaac ataatattcg gttttggcat agaacgtaat ttaatttttc 
86041 agttttggcg agaaaacatg atttttttat tttggcggag aaacttgata ttttaatttt 
86101 tggcgaaaac aaaataacta aattttggcg gaaaaacgtg atttctcaat tttggcggaa 
86161 aacatgattt atcggttttg gccgaaagca cgattttccg gttttggcgg gaaatcacga 
86221 tttccggttt tggcgggaaa tcacgatttt ccggttttag cggaaaaacg tgatttctcg 
86281 gttttcgcgg aaaaaaaatt cteggtttta gcggaaaaca tgatttctct gttttggtga 
86341 aaccgcgatt tctcggttta gacttacctt ttgtttgacc aaagctttgt gactctatga 
86401 agcatttgtg tttcttaatt gaattgtgag tcttgtgacc cctggtttgc atctcttcct 
86461 ttggttatat tgataaacag tgctattcct ccttttccaa tttttgattt taatcctgat 
86521 tcgtttttta ttaaaccata ttttatcttc cgatcattca ccattcactg ccactaacat 
86581 tctcgataat tcaagcttcg tttatgaaga taaattattt gaaactttct atctaattat 
86641 tttgcgttta tgatctattg ataatatgaa tctcatctaa tgt tatttaa .gagttatgtc 
86701 ttttcataaa ctgatattag ttttattttt tcaacataca actattaaat tttataaagt 
86761 agatataata tcttaataag agactctttt atttttaaga attgttcttt tccattataa 
86821 gagttgtatc tttcataaaa tgagttaaca ataataattt aaaacatgca cataaacttc 
86881 aatgtgtgtg tctttaccaa catgttaaaa cttttaagtt atgcggtatt tttttcatgt 
86941 aacaatcaaa tagtttatga gaatttattt ttattattct caattttatc tctgtttaat 
87001 gcaattttat tttattttac atgaaaacat gtatttgttt attcaaagat tttttctctt 
87061 gtgaagaaat acatccttta agaatactaa aaatgttgca aaatcgtatc cttccaaatt 
87121 tgtaacacat gtattttctt aaaaacaatt atattttttt taaatgttat tttataaaat 
87181 atagcatatt tatatttata aatgagttcc cgtgtgatat cgcacgggtt ccttacctag 
87241 ttaataataa taataagatt gcattaatac atagtaactg atggaaacca gtcgtagctt 
87301 atgtctttgt ggccaaggct atgtgttaat taatattatt actagatttt gatccgcgct 
87361 tttgaagcgc gggatatttt acgatgaaaa atttcactaa taatttaaca aatattttgg 
87421 taatttttaa agagtgtatt taaaatattt ttgcatttaa atcagtattt ttaaattcaa 
87481 cccgattgtg attataccgg ttaatccgga gatctgacaa ttcaatttat gtttttaaaa 
87541 tattcatatt aaaaaatcac taaaacccga gactaaccga ttgaactgat ggatgatcaa 
87601 tatgtaatct aattggattt aaattgtaat agtttcataa tttgtaatct tataatcgaa 
87661 attttaaagt tcactatttt gcaatttatg aaattatgac gtttctacaa aattttaaag 
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87721 agaaaatgat agatataaaa taattaagat taattatttt attatttgga aacattgata 
87781 gtagtataaa aatatattgt ttggaaacat tgataatagt ataaagaaat aagtatattg 
87841 tttggaaacg ttgatagtag tataaagaaa taagtatatt gtttggaaac atggatagta 
87901 gtataaaaaa gaaatattag -tgatttaatg tatgtttaac tataaagtat aaaagtatat 
87961 ttaatttaaa aacttacaaa ataaatgtta ggtccaacag aatgtttctg ttttaataag 
88021 atagattact ctataataac caatggaaat taaatgggta ctaaatcata taaaatataa 
88081 aagtgctgta tcaattttcc cttattgcct cattacttga tattttgttc gtgaatatta 
88141 tcgtttatag aaactatttc atttaaacta tgtagagtag ttattgtagg aggctaaaat 
88201 ctagcaattt tttcaaaaaa aaaaaaaaaa ttctagcaat tgccaaatct tttgttctat 
88261 ttgatttaca tatcttttgt tcagatgaat aatatatttt gcaaacccta tatatataat 
88321 cttattcatt aatcggtttc acaaacactc cccctttcta aagataaaat agtttttgag 
88381 ttgctaaaac aaatagttat ttattataat tctaataggc aattctcaaa aaaaaatata 
88441 gttcttgtta tacttaattt ttttttaatg tcaatagaag aataaaatac aaattttaac 
88501 gatatattta attccgcatc tatctaatct tatttatttc gtatatgtta gtatactaga 
88561 tgatgttgaa gttggtattt catatatgaa" agattatata tataattatg actttttctt 
88621 gggttcaccc^ cctaggatga acctttaggt tcaccaacca atagaaaatt gttattttaa 
88681 atctaatatc ttttaattaa gaaaacaaaa ataacctgcc aaattatatt atgcttttaa 
88741 aataaataaa aagattaaat aaataaaaat aacaatagtt ctcaataaag attattttaa 
88801 aaaatattta tttataagat ttggagttta gtgtttaaga tttatagttt agaatttatc 
88861 caaatgttta gtgtttttcc aagggttagg gtttacccaa aggtttaggg tttacccaag 
88 921 ggtttagggt ttaggattag agtttagggt ttagtatttt gttgagaaca tgtttagtgt 
88981 tttttccaaa ggtttatggt ttatccaagg gtttaaggtt tatgattaga gtttagggtt 
89041 tagtattaga gtttagggtt tagtgttttg ttgacaacat taattttttt taattcgttt 
89101 ttatatacta tttttattta tttttaaatt ttattttgaa aaaataatat aatttgacaa 
89161 attatttgtt ttcttaaata aaagatacta aatctaaaat aacaagtttc tattggttgg 
89221 tgcatagggg gtgaacccaa aaataactca ttaaatcaat ttatagcaaa tactaaccta 
89281 gtctaacaga tttccagtca aattgttttt aaggaaaccc ttacataata cgaaagatta 
89341 tacatataca aaagattata ccatccatat atagtaaatt ttaaattcgt tgaacaaatt 
89401 tatagtcaaa gttttgttat ggaaaccatt acattacgag aaatatttta tatataaaaa 
89461 gttaaaaaca aaatattata tcatccattt atagtaaata ataaaaactg ttgaccaata 
89521 tgcgttagaa tatagaaaat taattatgaa tacttataac aaaatatgtg ttaccaaaaa 
89581 tggaagatct atcatgaata tatataaaca caactagacc aaacgatcaa atcacaataa 
89641 gaagaaaata caaaaaggta aatcaagtaa gaaagtcttt tttttttcaa gttagaaatt 
89701 catatgcatg tcaaggttag aagttttctt ttgaagaaga ctaaatttat ttctttgaaa 
89761 ataagtattt aagcatctcc aaatactttt tagtatacaa agttttaatt tttttctaaa 
89821 tttatttatt catattgtaa aagcgtatat gttttagtat atgtaaaagt acgtaatcag 
89881 aatttgtttc ttaagagtta ttacactaga atctcgtctg atgttaattt tgaaatctgt 
89941 atttttttat ttctgtgaag gatacttgga caaagagttt ctgtaaccta cacatacata 
90001 tcatggcaac caagaaaagt ttatctatta tgttgtctct gttaatggtc ttgactctca 
90061 taagcctttt tcctacaatt tcaggtaata ttttttttaa ctttaggaag tatttatttt 
90121 atgattttat taacttaagg caaatagaaa atattctacc aattcgaaat tctctgaaaa 
90181 gaaatcaaga agtttttaga tgattcgaaa attcaaaaag actaaggagt tgagttaaaa 
90241 tatgttaatt tgctaatttt taatttgaaa gtaaaactaa tgctaattat attgacttaa 
90301 ataggtaacg aaggagaatg tcaacaacag gggctatgtg agggcgcaaa cccggaaaag 
90361 acctgcaaag caagatgtat ttctctgaat tacaatctcg ggggaatttg cttaaaacat 
90421 gcaaatggcc cagacaaacc aattacttat ttttgttgtt gcaagatata aattaataat 
90481 gtctgagact ataatatttt atcaatttat aaaattatta gtttacaaaa gttccctttt 
90541 tatttttatt ctaaattata tctttatata aaataagaaa gtaattaatt gaactgtata 
90601 tacattaatt aagccttaga aatttgaatt tcatattatg ttttatattg tttggtgtat 
90661 atgttgtata tgtttttaga tctaaatgtg gttttagata tagttttact aaatactatt 
90721 aaaatatatt gaattattaa gaaaaatatt tataatgcca tgatgaatat taaatagcat 
90781 tagaatgttt ggtttggaca tatccttgta tcaccacttt cttatattat gtgtcctata 
90841 tttactaaaa ctatgtcata ccatataaaa ttagaatgtt tggtttggac atatccttgt 
90901 atcaccactt tcttacattt cacaaactat gtcataccat ataaaattat tttgagacat 
90961 ttgacttagg cccctcttac attagttaat tatggatgat aggttttaaa attcattcag 
91021 aaattagagt taagatttat gataatgtat agaataagta atatatgaaa agaatgttta 
91081 ctaataaagg taacatatac tcggcctccc cactccatgc ggaagcagaa ggtgttattt 
91141 gggcaatgca ggagctgtta aaaccgaggg aaaagagaga ttcagctgcg atcagactgt 
91201 gagcaacttg tgaaacttat aaatactgat atggaatggc cagcattggc ctctgagttg 
91261 gatgaaattg cagcactgtc taaagagttt atgagcctct ctatatgcgc aatcccgaga 
91321 actcagaatg cccgagcgga cggcttggca aaagggggac gtacacgcaa gctcatcccg 
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91381 ttcgttagtg actccgcacc tacatggcta gcccctgaag ctagtctaac ggctgtagag 
91441 taatctttta tttttattcg atgtcaaaaa aaaaaaaaga tcttggtata aaatatatac 
91501 atattatgat tttatggact gatacaattg tagtgaccca agtgcagata taagagtaaa 
91561 tgagtacaac cttctacttg taaaatatta agagatagtt tgtgaaatgt aaagcttagg 
91621 accgccttct atttctcttt ggctgtcctt tatgggtact gacggtggat tataatggcg 
91681 atatggtctc tggcgagtat taagtatctt taagatcaca tttactgtcc gatcttcttt 
91741 atgttttgat ttgaccatgt ctgttcggca cagactatcc cctcatcttg cgatagacct 
91801 attttcttct tcttatcggg cctttgtaat aattgacccg gttcttggac cggtatggat 
91861 taaggtgaaa cctacctcca ataataatca tccagtcctc ggtcatcagc caggtcaggt 
91921 gtttattccg atttggatgg tatgtttgac gccgaaggtt tatgaaaaac ccagtaagag 
91981 attctggttt attgaattag ttgtgagatt tgattgatgt aaatcatggg atttgatttt 
92041 cccttcttag aaagtcaatc ttcttagctt ggcaattatt tccttttgtg gtttctgtct 
92101 cagaatataa cattcttgtc ttcttccttg gtttttcatt cttttaatca aaagaaaagg 
92161 aatggtcttt ttctctttct atatattttg tttctatttt gaataatgtt cattttcttt 
92221 tgtcattatt gfcagtatgat tgttcttcgc cttctattt-c tcaagtgttt tagctactat 
92281 tttgagttat tctctttgac tgtttatatc gaagtttatg tttctgtcga ggttcataat 
92341 cttgagattt tgttcggttg tttgattttg ttttttatcg tttgtttttc ttttagttta 
92401 tatagtgatg tttttgcctt cggcatcgtc ttttggtgag atttctctcc tttgcgtctc 
92461 tgatatcctt atacattttt ggtttgtttg ttctgttgat cttcttattt catcatatgg 
92521 tggccaaatg tcattatcct ctccaaccaa tttttaattt acttattatg tagcaattat 
92581 gtgacttgtg aatataattg gaaaaaacag tggttgacca tggtgtgaaa acattgtcgg 
92641 caaacaaaaa ggcagatttt aattattttt gtgtataaaa ttacaatgca aaaggtaatc 
92701 agatgcaaac gaatttaaaa accaacgttt aaagatttga ttatgtgtgg aggaagtttc 
92761 tagcaacaat tatgatttcc aggcgcataa ttatgataat tgcacacttg tgcatgggag 
92821 tgggaccttg gcaaaactta gcacaattat gataattgca cttgtcaata cataggaatt 
92881 taaatatcat tagtaatgag agttctacta agaaaaaaaa gtaatcaaac ctatactaaa 
92941 acgagaataa aatcaaaaga gaggtccacg ttgacaaaaa aaatcggcca ataggagtag 
93001 agcttcacgc cacgtcagat gtgtcattta agtcgttggg cttcgaattc cacaggcccg 
93061 acatcgttca aattcgagaa gcccgtgaag cccatttcac atctaagctc cctttcccct 
93121 tcctcttcgc gtcttgctcc tttcctatta atcgatgtga aacagagtga aaagatctaa 
93181 gaaacgtttg atgtagccct ggaaggaggc aaaaatttca tgctgatttt tgccgtgttc 
93241 gtcttctcac tgttaacttt ccagtaaata ttctctattc aatccttgct tcttttgttt 
93301 gacaactacg cttccagaga ccgtagacag gaagaggagg agcctctgtc ttcctcaatc 
93361 cgaactgcag ttccccattc gttttctccc tttcggattc cctcttcatt gcctcccgat 
93421 ctcataacca tctttcctaa agctcattca ttactgcatg aatcaaaagc caagcaacag 
93481 tttcaacttc cggtgctctc gattcttcca ttcatcgcac ttacatagac taaatgatcc 
93 541 ctgcaaccat ggcaaataaa acgtcctcat caaccagaaa ggcaagacgg cgtgcgttat 
93601 tataaaacaa tgttggccct cgatttcggc taatttcatt tctgggaatt acgtgagtat, 
93661 cagtcttagt gtctcttttc ctttcagtgt atcatatatt ttatgaacta ataacaagaa 
93721 tcatttgcag gcggagtaag acatagattc tgcaatgctc tcctctctgc cagagaagaa 
93781 attgcacaga ttgaagaagt taatgcagat gtgcagaaac aacatcctta aggtaatcta 
93841 aaaagggaac attcagtata tctctcttag tttct.tcatc aaattgaaca cgagacattg 
93901 gaggattgtt tgttagcgta tgttttcaga gagctccaaa tcctccatct acgaccctta 
93961 tcagttactt gatcaaactt gagatcacaa atacacgcat caactttacc gacaacgctt 
94021 tcaagttcca aaggttagtc^atcaacaaca tatccgtttg aacgttacta ttttcttatt 
94081 ttaacacacg gctgctctga aagccatccc agatatatcg gcgagaaaag aaaagaagtt 
94141 tgcagcaact gcctctatga aaactgattt agagaagaag aaaattgagc aatgtaagct 
94201 agaaaagtgg aggtactttt ctcaggattg tattatttga agcttggatt ttgaaacgga 
94261 gattatgctt cttgactttt gcaggaatca catacgagca aggaaatctt ttccacaaga 
94321 gcaggccgcc agagaaaaag ttgctgatcg agctatttat tttttaaatt gcacaacctt 
94381 ttgttttttc aaatattttg attacagtct tttcattatt tatgcttaac tgcttgagga 
94441 actaagctgc caagcaaggt acaaggttat ttgaatataa aagtccaaca gtgtaaggaa 
94501 gttgaaagtt gtttcttttt tgttcttttg tgctatcttt aaatcgatga caaaacaaet 
94561 gatttatcct gtttactata ggctggtgca tgtggaacgg tacaatagac tatggagaat 
94621 tcatgataga agcaatagtg cacataaaca gattcacgga gttcaattgc taacacagac 
94681 acaatgaaaa aaaaacgtac aagtaagact cctttccttt attgtttttt ttatcgttta 
94741 tgagtcgctt cgccagt ttt . gctctataac ttttttttac attcattcct ctttaatctg 
94801 tagctttgtt ctaaacttct aagtcgtggt ctctcttaac atcttgaact cttaattttg 
94861 tcttgagtca aagagagaga ttgattgctc ccatgtctca tctgttgaga agaaaaactg 
94921 tcatggctcc cgagaaatcc tctcagccgc cggacatgga gacaaagttg gatctagagc 
94981 cagagagtca aaggtaggag gttatcttga tccagcaaaa ttaatcaaaa agttcgtgga 
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95041 actttttttt tttattaaaa agatcgtgga 
95101 tcagtctcag ccagcacaac aatgaagtag 
95161 aggaaaccag cattcctcct gcgagctccc 
95221 actctggcca aaacaacaag ttccccgtta 
95281 attacaggat gcttttattg tgatttttat 
95341 tctgctatgt ttctagaaac aatattttaa 
95401 tttatatcat gttcattaag attctttgac 
95461 gacgtctgct acaaattgta taagctcaat 
95521 gtggagatct ttagacttcc tttttgttga 
95581 tttttaaggg cattcccatt atgttgggat 
95641 cactaaaaaa tgatactccc tctgtttcga 
95701 ttttaaaata agtgtcgttt tcggttttca 
95761 ctatttttct attggttgat atatggttag 
95821 aatatgtaaa actaaatgtt ttcttaatct 
95881 tgaaacgaag ggagtatact gtcaccaatg 
95941 ttttttttct tctcattatc ggtttagtta 
96001 taagtcctcc atttaaaaat ttaacacatt 
96061 tcatttaatg atgcatatta tcatacaaaa 
96121 attaaccaag cagaaaaaat agttactaat 
96181 aataaatatt ctcagaattt ttataaattt 
96241 gaaaaaattg ttaactaaat aagatccttt 
96301 tattatcttt acctttgtat cggttacaat 
96361 cttaatagca aacgtgaaag gacaaccgag 
96421 ctgcagattt tttcaactaa aaaaacaata 
96481 cctgagatac taacatgttc aggttcgacc 
96541 ttatcttagt cacctaacac ggatatatgc 
96601 agagatggtc tatccgttgg atgcacctcc 
96661 atctgggata tccctaggtt aataaaaaaa 
96721 caaacgatag ttgttgaaga ttaagtctaa 
96781 gtggctatca tattagactg taaaagacac 
96841 ttggagctct caaatagtgg atgcgctctg 
96901 attttgagct tatagaagat catgtaatac 
96961 aacttaaaaa aaaaaacaga gcacccaaca 
97021 atctgtgaac aattacaaca ctaacgacat 
97081 gaacgctgag atcccaccga ctattccaga 
97141 ctactctaaa tgtccactaa acacattagg 
97201 cctgaggcgc actatattgt tcaatacttc 
97261 catctacgcc aatcagccac taggaacaat 
97321 atgcttgctg aaggtcacta tcaaaccggt 
97381 aagaaacgat caacctccga tcattgctgt 
97441 cctgttccta tggagcagcg gtactatgta 
97501 tgcgatccca ataacatggc taaagtatgc 
97561 caatttgtcc actttgctac taataaagaa 
97621 gttaaccaca atgtattttc taaattcgtc 
97681 ttctattgca tatttaattt taataacttc 
97741 taaagtgaca acttaaaatt gtataaaaat 
97801 tccttaacaa cttaaaatta aaataacaag 
97861 aaataacaat gcaactaaga ctaaaattgt 
97921 tacgaacttt gacagttcaa tcaatatgtc 
97981 attttacaaa ttgttttact tcattttatc 
98041 aattttagct ctattttaaa aattgttttc 
98101 cataacaatt taaaaaataa atttcacact 
98161 cccatttaaa aatgtattat gcttcggctg 
98221 ttaaaaaaaa aatctaatat attctcatct 
98281 aaactttcac attgtacaat atcaacataa 
98341 cttaatatta ataaaagatt ctaaatctaa 
- 98401 taacgcggac aaagactcta gtatacatga 
98461 ctttatttat tttgttttaa ataaatccta 
98521 caaagttaat tgacagaggt tcgggc'tgca 
98581 gccgaaaaga ggtccaaata atataagcta 
98641 caacgcctga aaaaaagagg aaaatctaaa 



actttgcata caaactttga aagcaagatc 
gaattttacc acaaaaaaaa acaatgaagt 
atcctccaaa ggctaatgac caaactgttg 
taagcgaaaa acttatcaac aacgtcatcg 
tgttttgtcc tttagtttag ggttaaatta 
atcccattct ggatatcgtt tcggcataag 
gtcaaagttt aagcaaaaaa aagattcttt 
ctactatgta taacacccat tatctggaat 
cattggagta tagcagatca tatatgtatc 
cggaagccat aactgtgtta tgttcactga 
attatttgtc gttttagagt agaattttcg 
atgcaaaatt tattgacaat attctctgtt 
gtgtattgaa aatggtgttt ttattttgaa 
gtgtacataa acctagaacg acaagtaata 
ccgttcaaat gtcaaaacaa tctaattatg 
ttgtcataga tttttaatct aataactata 
cattaaattt taggtcatta attagtatag 
tgttttaatt ttggaaagtc aaaataaatt 
acaagaaaag gaaaatgata tattatttta 
attacatttc tgataatttc aaaattaaaa 
ttgtttggaa taatgctagc ataaaaagct 
gagcattata acattgcttg gacgattcaa 
atcaatgata gtatcgagag acaccggaga 
acaatcatgg tcgggtggta aggaggcgga 
catctgctac ggaaaattaa gttatagtaa 
ccatgtttta ggatccattt gaatatccga 
ttttggggat tagtctaggc ttctccatgg 
aaacaatatg tttttttggt aaaataataa 
aagcagaaac atcactaccc cagtctttta 
aaatggatct tgatgtatat atatcaaatt 
ttacttcttt tccgccattt tccgcttaga 
taaaacattt cctcaagaga ataagagtaa 
aaacagacca agtacaatgg gatgagaaaa 
tagtgatgtc tctgactacg atgattactc 
aacatcagtc tgcgatactt tgctgcgaaa 
ttcaaagaat tgtgtttaga gaacggcaat 
gtccacaaag aaaaacagac aggcctcttt 
ggaaataata tgcatcttta cgttttgtta 
aaaaaatatt tggataaact ccaatggaaa 
gaaagaatca agaactcact aagtgcaata 
aatatggtca accttaagcc acatacaaat 
aagcaatgtt actaattcaa aatgcttaac 
tgtttatttt aaaaaatttg tacaatattg 
cgtattaaaa taaaaattaa gtttagtagg 
aaaagaatga agtagtgctc tgatagaaat 
aaaaccatcg agtccttaac aactatcaac 
ttgtctaaaa aaaggaaaaa ataaaaatta 
gatgaatata taattagtat aaaattagta 
aagcacttca aaaacaaata aattgttttc 
tttataaagg aaaattttgt ccatttcaca 
ttagataaac ttatttacaa aagaaaaaaa 
tcaaccactt cctattccct ttactcatat 
aaataaatgc accgacttta ttttcatctt 
ttttcataaa tttaatcttt cccactgtga 
gaaaactcat ttaaacatac caaataaatt 
aaaaaaatat ctaaacctat tttccacgcg 
aatacaaagt aaacaagact ctaaacacac 
agaaattaag cttccactag cagtattcaa 
aggcccaata agtaacaaac atatgaagag 
aatccaaatc aatactgtcc agcgtgaaga 
ggaagcgtat aaagtgacgt gagaaatcac 
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98701 gaaatcacta cctatatata aactgaaacc aagagagaga gatccctctt ctcctccttg 

98761 tctccacaaa ccctaaagcg ctttatagtt tcagtttctc tcagaatcgg aatcgtttgc 

98821 ttcagtaagt tcctcttcca atcccttact tgttgatacc tgtttattcg taattaggtt 

98881 atgatctttt gattcggtta gatgatggat gattctctct tcgtaattag ggttcttatt 

98941 tttttcaaga tccgtgttct gcaagctata gatttatctg ttgacgtttg aaagtatcga 

99001 ttaatgcaat ttcgtctttg ttcgtaatca gtttgaagat cggaaaaaac aagaatgagg 

99061 attggtgttt ctgagccaga gcaatgcggt tgcgacacgt gtgtccagca tcgcacttta 

99121 tgcactcaag aaaccgaacc tagcaaagaa gtgactggct catcggttcc tgttagttca 

99181 gaaccagttc aacgtctcgg ttccacctca gatcagtgtt ccggaacaca tacgactcca 

99241 ctcgctcctc ctgaaccagc agcgcagtct gttgatgcat cctccacatc atcctccata 

99301 ttcagttctg ttagttccca accagcgcga gctctttgtc ccaccggttc actgccagtt 

99361 cctttatttg gttgttcatg gccacgtcct tgtagctgta ctggttgttc actgctaggt 

99421 ccttctattc gtcgttcatc tcccttcttt actgcttctt ccggatcatc aatctcctcg 

99481 tcaaggcaag ccaacgttac aaacagtttc ggttccgctg catccgaacc atctgtatct 

.99541 gggccaatga aagctcctat ttttacttct ggctcttcaa ccgcttccac atcctcaact 



99601 ctaccttcat tagttactcc ctcggacatt acaagaggat cagtgcaagc gcctgtccaa 

99661 gctaacactt ccaagactgc ttctgatttt catccaccta acgttgccaa cactggagtt 

99721 tgcgctgctt caaggactag cacgaacaat ccatttccag gatttagtgt tgattacttg 

99781 cccagatgtc cctctaacct ttctcgacca aacgcaccaa ctactacacc agttcctggc 

99841 cctagttcag ttttggctgg tggtgaaact gaacaaggta gtaggtatcc tcgttatgcg 

99901 cctacaccag atgttgacgg caagcagatt atttccatat ctgcttccaa ctcacatgga 

99961 cataaaagtc atgaagagtt gaggtgggaa gattacaaaa atggagacaa aggtaaacta 

100021 actcacttac tcaaatcttt aaatacataa cgtattatag cattttatct tttagatctc 

100081 tctgtaattg actaatctca tgttgcaggt ggggttggga ggtttcatcc ggctcacaaa 

100141 cctcgtaaag aatcgaggtg ggataaagaa aaaaatggag acaaaggtaa cttaactaac 

100201 tcactactca gatgtttaat agcattttct gttttagatc tctctctcta attgactact 

100261 ctcatgttgc agctggggtt gggtcgtttc ctcctcctga tcatacaccg tcggtgttta 

100321 ctcctccaag catacctgat cgtcctcgga tgagaactat tgatctaacg aaccgagaca 

100381 cgagtggttt tcctattggc tacaacaccc ccgctgcttt ccagagcccc catgaacccg 

100441 ttggtgtttc ttccccagca tcaggatgca cagcgtgtgg agccgcgagt agctcctctc 

100501 cttcgagtca cttgggcttg aacagtacca caaatcctcc atcatctgcg acatctcttc 

100561 ccgggctgtt cttttccacc tatggttctt ttcctttgct gtttgcaaca ccaaatcttg 

100621 cagcttatgg tacaactcca gcagtccaag cctatcctat gatgtttgga ataccaaatc 

100681 ttgctgctca aggtacagca actccatcag ttcaagcgta tcctatgata tttggaatac 

100741 caaatcttgc tgctcaaggt acaacagcaa ctccagcttt tcaagcgtat cctatgatat 

100801 ttggaatacc aaatgttgct gctcaaggta caacaacaac aactccagcg gctcaagcgt 

100861 atcctatgat gtttggaata ccaaatcttg ctgctcaagg tacaacaact ccagcggctc 

100921 agccctatcc tacgatgttt ggaacaccaa gtcttgctgc tcaaggtaca acaacagctc 

100981 cagcagttca gccctatcct acgatgtatg gtacaccaaa ttttgtagct caaggtatga 

101041 ctccagcagc tcaggcctat cccgttaatg gttcaagtct tctcccattt gccgccatga 

101101 gtctgcagta atcgtgcctc gctctgcatc aaagttgtgt cttcttttat cttgtagttc 

101161 taaggtcttt gttttgctta agagtcataa atcaagtctc gtccaacttt atgaattttt 

101221 ttttctaagt aaagattcag ctaatcatgg atgtaaacaa gtctcttaaa acatcattat 

101281 tttgctttta tcaataaaag ctcctttcct tctacj:gtaa tttttctctc ctctgttcta 

101341 ataaacaaaa aacaacaaaa agaatcatgt tctttgttat ctctcactaa gccatgcctc 

101401 aacatgaaca catgttgcaa aggcatactg agaaacttta caaaaacaaa ctgcaaaact 

101461 ctattctctg atctatccca accgcacaat caacacaata tccgagaaaa ataaaatttc 

101521 atgaacatat gttcttgtca acgcaagcta ctgtccaagt aacacttcca catcctttag 

101581 cttcggtggt tgtagggcat ttgttgcaag gattttggaa ctgaacaagg tagtatatgt 

101641 cctcgttatg cagctacact ggaaggacgt aacacatgta actgcttcca actctcatgg 

101701 acataaaagt taaggagagt atgaagtaga aagattacat aaaagggaaa caaaggtagc 

101761 tgactataac taaagattta acaacttaaa agaaaagaac tcttttcctt ctactatatg 

101821 ttctcttatg tcctatgatt ttgtgaaaca acaaagaaca cacaaaaatg tgtggcatga 

101881 atcattgcta cactctccat tcccaatcct ctgtttttgt gttttcagac atccaagtta 

101941 caaatgcgtt aaaattatga ttttctcttc aagtgaaatc atctgatgat ttcatatttt 

102001 acatattcct tctcatgtgg ggtttataga gggatatttt atgctgaatc tccccctctc 

102061 tttagtgcta attgttgaat cattagaaaa ctgcaactca cagatacgga gtaatgtcca 

102121 aaacaaacat agatgctgca ttaacattac aaatattaag ttcgtctact tcatttgata 

102181 cacggttaat cattttaata tctaaagata atacttcaaa cacggcgatt ttatttacta 

102241 cccaataaaa atgttagcgg agttaatgtt caccacaaat aaaaatttat tatcaatcct 

102301 atacgataac tatttgttta tctcctcttt ctcaagcata tatatacttt tatatgcaca 
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102361 tttacagtat acattattta aaattaaaac aagaggtaaa ctacaaaccg aacttatgtc 
102421 caacatgaaa agttttttca acttccatga atcaatcgca tcacacttag atactgtagc 
102481 attgacaccg gtagcaaata catcacatat cacatgcatt tcaaattata gaacaaacat 
102541 ataaaaaagt tagattgtgg ataaacaatc cgctcgtagg gcagctttga ttttagagag 
102601 atttcctagt gctattctta cagcttgcca tactaattaa tgttcccctt tgctactgat 
102661 agtatatagt agtatactaa atttctaaga aggttaaact ttcattatta ttcaggaggt 
102721 tcagcccgta aggcccataa ctaagaaaca catagagagc ccaagtccac tagacactta 
102781 atttagctat ttcaaatcaa tttaaaaaga agattataag aaacgtccgg cgtgaaatca 
102841 acgcctgaaa gaaagaaaga aaagaaaaaa aaatctggaa gaagcgtaca agcgacgtgg 
102901 ggaaatcacg aaatcattac ctatataaac tgaaaaagca gagagatccc ctcttcatct 
102961 catccctcta ccgagtctct ctacaaaccc taaagctcga tttcaagttt ctcagactcc 
103021 gaatcgttac caaagtaagt gccctttcta tctctcatct cttcctttaa tactttgcag 
103081 aatttgattt cttttccgcc aaaagttctg atctttgttg aatactggtt agccctttat 
103141 tcgtaattaa agttctgatc tttgatttgg gtttagatga tgatattacc tcttcgtaat 
103201 tagggttctg agtttttgtt tttttttttg* tttt tgataa tatctgcgtt gcctctgttg 
103261 caagtaatta gggttctgag tttttctttt taataagcta tcgattatct gttgatcgat 
103321 tttaatataa atttcgtctt tgtttggtat tagtttgaag atcgagaaac aagaatgagg 
103381 attgatgttt ctgagccaga gctatgcggt tgcgacactt gtgtccagca tcggactttc 
103441 attactcaag aaaccgagcc gagcaaagaa gtgattggct catcggttcc tgttagttcc 
103501 gaaccagttc aacctcttgg ttccacctca gatgagagtt caggaacaga gacgactcca 
103561 ctcgctcctc ctccagtcac cacaccggtt aataatcctg aaccagcagc gcagtctgtt 
103621 ggctcaacca tcccacctgc tgttacacca gttagttccg aacaaccagc acaagctctt 
103681 ggttccacct cggatcaaag ttccggtaca gagaccactc cactcgctcc tcctatcacc 
103741 acgtcggtta agtctgttga ctcgaccatc ttcttcaagt tcccaccggt acaagcacaa 
103801 gctcttgccc ctactgcttc cggttcaacg caagcccctg cttttggttt tggtgcattc 
103861 gctgctcgcg taccatctgc cacctccggt tgttcagcat ttagtttcgc ccctcctgtt 
103921 acatcggcac cagtgcaagc tctaggcaca accactacta ctactactac tacatccgcg 
103981 gccgctcctg catctccatt tcacagttcc tcaccaacca cattccaatt ccctcctgct 
104041 tttacatccc ttgctgcttc tacttttcct tctgttgcat catcaacttc atctccactt 
104101 gatgctcctc cctcaccatt tagatgggga tcactgcaag ctaacacttc cccacccttt 
104161 agcttcttgc cagcgcaagg ttctgacaag actggttctg cttttactcc accgtttggc 
104221 taccctggtg gttttgccag acctgatgtt ggtgtctctc atccagggtt tggtccctct 
104281 aaccattttg gaccaaacgc accaactact acacctgttc ctgttcgcag tccatttttg 
104341 gctggtggtg gaactgaaca aggtagtagg tatcctcgtt attcacctac accagatgtt 
104401 gacggcaggc tgataatgtc catatctgct tccaactcac atggacataa aagtcatgaa 
104461 gaattgaggt gggaagatta caaaaatgga gacaaaggta aactactcac ttactcaaat 
104521 ctttagtaca taatgtatag ctttttatgt tttagatctc tctgtaatta actaatttca 
104581 tgttgcaggt gggtttgggt ggtttcctcc tgttcataca tctccctttt cctcaccaac 
104641 ggtatcaccg tcgctatttg ctcctccaag catacctaat cgtcctcaga tgagaactat 
104701 tgatctaacg aaccgagaca tgtgtggttt tcctattggc tacaacaccc ccgctgcttt 
104761 ccagagaccc cctgaacccg ctggtgtttc ttccccagca tctggatgca cagcgtgtgg 
104821 agccacgagt aggtcctctc cttctagtca cttgggcttg aacaatacca caaatcctcc 
104881 atcagctgcg acatctcttc ccgggatgtt cttttctacc tatggttctt gtcctttgct 
104941 gtttggctca ccaaatcttg caacttatgg tacaacagca attccagcag tccaagccta 
105001 tgctattatg tttggggctc caaattttac ttctcaaggt acaacggcaa ctccagcttt 
105061 tcaagccttt cctattatgt ttgggactcc aaatcttgct gctcaaggta ctacaagagc 
105121 tccagctgtt caagcctatc ctacgatgtt tggcacgcca aatattggag ttcaaggttc 
105181 aactccagca gctcaaacct atcctttgat gtttggcacc ccaaatcttg ctgctcaagg 
105241 tacaacaaat attggagctc gaggtacaac tccagcagct caagcctatc cgttgatgtt 
105301 tggcacccca aatcttgctg ctcaaggtac aacaactcca gcagttcagt cctatcctac 
105361 gatgtttgga acaccaaatc tagctggtca aagtacaaca acaactcgag caggtcagcc 
105421 atatcctacg acgtttgctg ttcctcaagc tgcgacagct ccagcagttc agccgtatgc 
105481 tatgatgttt ggtacaccaa gtctcggagc tcaagatatc actccaggag gtcaagccta 
105541 tcccgctcat ggtttaactc tcccattcgc cgccatgagt ctgcagtaat tgctcctcgc 
105601 tctgcatcaa actcgtgtct tttatcttgt agttctaagg tctttgtttt ttccttaaag 
105661 agtcataaac caagcgtcgt ccaactttgt gattttcagc tatccatgga tgtaaacaag 
105721 tctcgtgtct tttatctttt agttctaata tctatctttt taaaacacaa acattatgtt 
105781 ggacctaaca tttattttgt aagtttttaa attaaataca catttatact ttatacttaa 
105841 acatatatta agttactaat atttctttct ttatactact atccatgttt ccaaacaata 
105901 tatttatttc tttatactac tatcaatgtt tccaaacaat atatttttat actactatca 
105961 atgtttccaa ataatacaat aattaatctt agttatttta tatctatcat tttct-cttta 
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106021 aaattttgta aaaacgtcat aatttcataa attacaaaat agtgaacttt aaaatttcga 
106081 ttataaaatt acaaattatg aaactattac aatttaaatc caattagatt acatatcggt 
106141 catccatcag ttcaatcggt tagtctcggg ttttagtgat tttttaatat gaatatttta 
106201 aaaacataaa ttgaattgtc agatctccgg attaaccggt ataatcacaa tcgggttgaa 
106261 tttaaaaata ctgatttaaa tgcaaaaata ttttaaatac acactcttta aaaataacca 
106321 aaatatttgt taaattatta gtgaaatttt tcatcgtaaa atataccgcg cttcaaaagc 
106381 gcgggtcaaa atctagtttc tcgttaaact cgctgctcaa aataaatttt catgaaccta 
106441 ttttcttgtc aacgcaagca actgtccaag ctatttttat attacatgtt ctttctctgg 
106501 gttgttacaa cactatcatt ttgatattct ttttgaatga atagaaaaac caaaggtggg 
106561 aagtttacaa ctttgtcaac tctataattg tgttattaag gattccattc cgctaacaat 
106621 tttagtttgc ataatagttt tttctctgaa ctggtacaac accagtaaac aaatgattgt 
106681 cttctaaatc gcaggactaa gtctctgttc tcctacattg actctaatta aacaaatgtg 
106741 tcaacttgat gatttttact tggtaaagac atctaccaat cattgatgaa aacaataaca 
106801 tttcatgcac attattcctc tgaattagta caacatcttt ctttagtaat tcttttgaat 
1068~61 aatattcaac aagaagaaaa ctccttttct ttctactata aagtctctat cattctctca" 
106921 tgcagaacgc acacacaaaa taaaatggca agaatcataa- ctacactctc* tacacctctg 
106981 tttttcttct tcctcttctc cctgctctct caccaaacca tttctcaacc cgaacacttg 
107041 actactttct gcaacccttc cgacaacttc acacaaacca gttcatacga agcaaaccga 
107101 gaccttctac tctcctctct ccgcgtcagt tcctccctcg gaacctattc aaacgccaca 
107161 gtcggtcgta gtcccaacac agtccacggc atgttcctct gcagaggaga caccaccgca 
107221 gcgtcttgct cagactgcgt ccagaccgct acaatcgaga tcgctacaaa ctgtactctt 
107281 aacaaagaag cggtcatata ctacgaagag tgcatggttc ggtactctaa tgtttccttc 
107341 ttctctgttc ttgaggtcag accgagcatc gtcctttact ctcttcgctc tgctccaaac 
107401 tcgaatacgc tcaatgaaac gttagctgat aaattcaacc aactgattct caacgtgtct 
107461 tcgtcctctt tggttccgta ttttctggaa gatcaagaac ttgtgactca agcagagggt 
107521 tcttataagt ttgagtcaat ggttcagtgt agtcctggtc ttgaccggtt caactgtacc 
107581 gtttgtctca gatttgcgct cttaagagtt tcaacttgtt gcggttcacc aagttctgct 
107641 ctgatcttta ctcctaaatg tcttttgagg tatcaaacct ctgttttgtc gtcgccgcca 
107701 ccgttgcctc catcctctcc acctcctcta tcgttgccgc cgcctccacc gtcaccggct 
107761 ctatttttac cgcctcctgg tttatcgcaa ccgccaccac cgccgctggt tttcacgcgg 
107821 ccacagagca gcgggtcgtt ttctaacgtt attaaaggtg atcatatttt tgtatgtatt 
107881 tttgtttagt tttgatcata tgactttctt tttcaaaaaa tttgatcata tgagtatctt 
107941 tttgatataa tgtttatgta tccaactaca aatattattg acagggttgg ctctaatttt 
108001 attccatttt ttttgtggca ggaaaccaaa tatttgggag aattgctatg acgatggcag 
108061 ttttggtgtt tgcacttgtg aatttgtgaa cacatttgat gattttggat tgaaataatt 
108121 tgcaattcct tttatgtttg tcatgtgttg ttgttgtctt cttaagatca aaagccacaa 
108181 agtatgtgaa tattgaaaat catgtttctt ttttgaacat ttaaccatgt tgattttgaa 
108241 catctaatca ttttgatgtt tgtttgcatt tgatcaacaa aagaagaaga gttggaggct 
108301 gatatatcaa attataatta atttctgttt tacatactct tttgctatat caatactttt 
108361 aattttccag catgaccaat tgataactgt ttagtccatt ■ tat ttataat catgttttcc 
108421 taataattaa ctgccaatat ttaacatata gatatctgca aagacatatt agattataac 
108481 tgctactaca ccactacaca ttttaagatt atcacgaaaa caacttctgt ttttgaatgt 
108541 ttaaaaaaat caaacacacg cacaaatata ttaatatgct atacaaattc ttatataaaa 
10&6.01 ttatggtttt atggtatgtg ttatacttta gcataactgt atgtatataa atatatatgt 
108661 atattattta tagtttgtat tgcaatattt ttaataaact aaacaattat tatatttaaa 
108721 atttaaaata aatcttcaat ttgtattcat aaatttatga ttagaaagtt aataataaaa 
108781 tagtaataat taatttcaga catatttaca gaaagtttaa aaaaatcaaa cacacgcaga 
108841 aatatattaa tatgctatac aaattcttat ataaaattat ggttttatgg tatgtgttat 
108901 actttagcat aactgtatgt atataaatat atatgtatat tatttatagt ttgtattgca 
108961 atatttttaa taaactaaac aattattata tttaaaattt aaaataaatc ttcaatttgt 
109021 attcataaat ttatgattag aaagttaata ataaaatagt aataattaat ttcagacata 
109081 tttacagaaa aaatacaaaa caacactata tatgcgcttc aagtacagat caagatttaa 
109141 tataggtcta acattttacc ggacccgaaa aaccaaacta gaacggatct gaaaatacag 
109201 gttcagattc gagtttatgc taaaatattt attggatttt ttttatggag atccacaagt 
109261 ctcggtttgg atctacgtcc tacaagagac ccaatcaggt attcgaaata ctcaaaaatt 
109321 attatatatt aggtaaattt ggatgattgt gtgtatttta gatatttcag atattttttt 
109381 aaatttcgga ttcaggtttt taggtataat ttcaggtttc gggtaaattt tagatttcca 
109441 aaaaatataa tttgggtgtt cgggtaaaat ttgagatacc tttcgagttt tcggatctga 
109501 ttttgtgtaa gtttcgatat attttctcca tgtattttta ggatttgcag gtactttttg 
109561 agttttcaaa ttcagtttgg atatgtcagg tttttttcgg gtcctatata cccaaatcaa 
109621 cccgaatccg aaataaaact gaaattatga atgttttacg tatattagaa ttatatactt 
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109681 gaaccgacaa gagttcgaca agctcctggc cgagcagacc taaaaaacta tagatatcta 
109741 tatggatcta gatcttcagg aaccgaaaga tgcggaccca ataagacttg atccgaggac 
109801 cggatgccta ggcctaatct gaatagataa atatgattac aaaaaataga tttatagtta 
109861 taaagaaacg agtatatgcc ^ggccaacttt aaattactat tatttctaca gtaaatcaaa 
109921 taggttttga ttaagattta tacacatata tgactacaac aaaatatgat taagaaaaaa 
109981 acacaagtat tacaattaaa tatctagaaa aagttaaaca aaaaatatac atgtcttttt 
110041 gaaagggcgg gtcagaattt agattctgaa aacccgaatg ttctcaaata tttaaaaaaa 
110101 tagtttcatt taaaataaaa aatcatataa actttttaaa atacataaaa acagatttta 
110161 tcaatacata acgggtctat gacaaaaaaa acagtttaaa aatattaaac tttttcttta 
110221 aattaaaata ctaaaaataa aatcactgtg aaataattca caagtttaac agcaggttaa 
110281 taaacaaaaa tcataaaata cgaatcagaa ttatttggag tcttcaaaat ttttgtcata 
110341 tttaaaataa caaaaattat tcatacaata attttagata aatttcataa aattcttaac 
110401 aaaatataat ttatcaagta taaaaatatt catgacataa aattaatatt ttcaacaaaa 
110461 atatacctgc agaatctagt gtactaatta aatctgcata attaccctta gtgtctattt 
- 110521 ttttaaaaga aatcagcatg caacaaaaat tcgttggtag atggactgat tcgcaacgaa 
110581 tacatcacaa aatcttagtt tgcagtttta cget-aagcaa ttatcaaact aaaacatcta 
110641 atctagtggt ctcatgcata aatgcatact gttaaatatt gaatgacggt ttaccaaata 
110701 aaaagttatg cggtcgtcga tatcatactg atatttcaag ttaagataac tgtttttttt 
110761 aaagcacact tgtttttagg gtttatcaac ttatcattct gtaagcgcaa tatgtacata 
110821 tattagcata ctcggtcaat gttatatgtt ttcattgtta aatgcagtct tctccttcct 
110881 tcgtttgggc atagtattta acttttatta gtattatagt ttatcctagt gaacccaatt 
110941 aaagtaagtt tgatagatgt caaccctttt aagttgctta aactaatcaa cacttgaaag 
111001 ttcgtcacga aataatccaa tacctatctt ttgagtgttg cagccatgct . agtttggctc 
111061 agatcatctt gtcatgtata atctccgctt tgcaaccgag tctgggcgtt ataaatggtt 
111121 agtagacaaa ttcttttatg caatgcaata tatatatgcg gatactgata tattgactct 
111181 gcactggctc taaattttag gttacatacc ttatccggaa tcatttgata tacaacttat 
111241 cgctttgtac taccattttg gattttgtga ctggctgacc cgtttctgtt agtcttgcct 
111301 caccctgtat gtgagatgtg gaataagcaa cttattaaac ccaaacacta 'atattttggc 
111361 ttattttata ttgcaggaat cgaaatatat tggagaattt tcattacagc catggcaact 
111421 tgggtgttgg cacttgtggg tctatgatat aaagttttat ttatatttgt attccacaat 
111481 tgtatgtgcc gctattttct tgtatagtta ataagaagca acatacttgt tttctaagtt 
111541 atctcaaaaa agtttgacgt tcgatatgtt tactaaataa aatcctaatg catcaccata 
111601 taccttcttt actgtaagaa tgaatcgtcc aaaaacaact ctgtgtttca gacaacaaat 
111661 gtgtcaactt catgttttta ttagcttcat atggtttaaa ataatagcca gccggaaggc 
111721 tgcttaacgg aaccaaaagc tattatgaca gaaaaaaaca ggtgaaagag gagttaacat 
111781 cttgcacaag tgattaacga agtcttattt gcttgagaga tcaaggattg aaaccttgat 
111841 tttgcaaatt tattgctctt caatataaca tcaaaataac agaaaatctt gccattacta 
111901 tgttttcaaa ggcttacata caaaagagaa aaaaaaagct tatcaaatga gaaacgttaa 
111961 ggaatatggc acagattaaa cgttcccctt agtgtgagct caatctattt gcgtacgtgg 
112021 caagtatttc aaggaacact gtttttggtt atggcttttc aaaacaattg ttgcttaggt 
112081 ctagggctta ccattgccaa ttttatgtgg aaatgctgaa gataaaacca aagcgtaaat 
112141 tggcgtaacc acttctttta ctgctgtatc tgcctgctac aatgaatgtt tgatgttcat 
112201 agaaaattaa gggaaacaga ggagggacgc tctaaaaagt tcctactaaa ttaacatgta 
112261 atccaacctt caaaacacac ttgtctttat tctttacatt gttaaagcaa tcaatacatt 
112321 ttcgtttaat tctccttgga aatggttggc ttctccatga acttggagac aaccatacca 
1123 81 atagcaaaga cgagaggcca cactgatact ttgacaggac cgtctcgtcc cctgtcagta 
112441 aagctacaat tgatcgtgga cactattaac tgaccaacac cgcagaaaaa caagatagca 
112501 tccgtggaac ttccaaagat tctgttgtag ccacgtggtg atggatagta gaaccaagga 
112561 aaaggcaact tgcaacggaa acgagcttta tgggctcggg ttttgtaaga gaagtcaagg 
112621 atagatagga taagagaaag agttgacatg actaacgaga tcaattgaag gaagacagac 
112681 aaacaaacaa gtttaggagg tgttccccgg ctatgatgcc cacataggtt aaagcttgtc 
112741 tccatctttg ctaccacgac atgtttcctt gaaagctttc ccgaaatctc cagtctgttt 
112801 cctaacatcg gatggatcta ctccgtagaa aactgtcgtc acgatttgtt ccttgcactt 
112861 caaaatctcc agcaggtttt tcaaacacca gctggaagaa gcataattct tggaaagcac 
112921 cacgatcgat atcactgatc ccctaatagc ttggcctctc tcagtccctt gatcatcgaa 
112981 cattaaaatc cccttgctgt ttaactgtct ccgcaaatga ctgataaaag tcttagggac 
113041 gtcgggtccg tagaaactcg caaagagttg gtagctccaa gtgcgagtcc taaaagaaga 
113101 agaagaagaa gccatgggat aaagtattag ctagggagag agggttacaa gatcagatat 
113161 acccaaaaaa aaagtctggg gataagccat atatgatcca gaaagcgcca gtgttcactg 
113221 tttaataaaa gaccgaacaa gggttttttg gcattttcag tagccaacaa acttctttgc 
113281 tctttctcaa aagtgtacaa gacttccttg cacttcaagt ctctccaaca gctctatata 
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113341 agctccatat ccatgaagca ttgttctttg agagcacaaa cgatcaaata tcttcgacta 

113401 gtttttagga gcagagaaga gcgggcatct tcgctttagt tccataatag aaacatcaaa 

113461 atcttataga aaacctcaaa agcttacgtg gacggcatta agttaacact tggtattttt 

113521 agttaattag gttttacata gaagaagaac agacatcgta tgctttttag ttacacaatt 

113581 agctactaaa caccaacagc tcagattcat tctcaaatct tcctcccccc cccccgtcct 

113641 gatgaacctt gagtgagtct tcaccgcctt ctgttgagtg gtttcattga agttgaagtg 

113701 ctaatcgtcg gtcttttagc acctgaggcc aatcaacaaa aggttagatg actcacgcag 

113761 caagattgac actaaagacc atataatgag atttccattg tgatgcaaac taaagaagac 

113821 atccatatgc atagtttaca caaaagatca tagaatgtaa ctatgcataa cgatttggtt 

113881 tttaggatac aagttactat gcaaaaaaga gcttatgtta cttgttaagg tcttcgagga 

113 941 ggagctagtg gcaattgaca gacgagaagg tctggtggtt tggcctggtg gtttcaagat 

114001 ctgcttcgga atcttcgatg gaatactagt tctgctttta aaaataccaa aacaaaacaa 

114061 acataagaat ctaaaaaaat atgtattcat attgttctgt taaagtgtgc tagtagaggg 

114121 tacttacttg tctttgggca cggttttcac catctttgct gatttctcga gcttctcagg 

114181 gaccaagtgt ggttcaggtt cttccgttat ttcaagaggc ttcacggttt cagggaaaag 

114241 agtcaactcg actgcactgc atatggagcc atcagtttct gttcccatcg aaagacagct 

114301 atcagagata tcactcaatc tgtcctcaga atctgcatcc ccaaggccaa tgagttcaat 

114361 gtcatcttca atgtcaattt cttttgaccc accggcaaac ttagatagct ggtgaagatc 

114421 ctttcgatgt ttaagttcgt ccgatgaacg cggtgaaccg gaatcagaat gcttgctact 

114481 gtattctgag cagttgtcaa aatctgaggc tgctctccca aaagaaccag gtgcctttcc 

114541 tcgtcgagca tttggtgaag gtcctataga gtgtcttcta ggtgatgaag gccccaacaa 

114601 tcttagattg cttaaaccac gtttcggcac ggttgtactg ttactttttt gcttctgaac 

114661 attttgaagc tcttcatctt tcttggcaat aacatccttc aagcttgata cctgttacat 

114721 caaatggtat caagataagt ataactttgt tctctattta cttccaatgg gcaagattaa 

114781 attttgagag agatagaaag aaaataagca cctgttccat gaggtgtcta acatctcgtc 

114841 cctctttatt acttttagct gcacctaatt ccacaccaga aactctttca gcgaacttca 

114901 gagtgctaac cgtctcagca taagaatctc catcaggatt gacttgaaca aacataagag 

114961 tcttagcctg tcctcctaca agacaaaaca aaaaagtcaa gacaaagggt taaggaacca 

115021 tcaacataaa gccaacgggt atatcatgca atacgtgtag cccaaatctt aaaacttttt 

115081 cttcaaagag ggttaagaaa gtacccaaag aactttgaag gacttgcgtc aatttgctgt 

115141 ttctatacgg cacatggggg ttcttatgcg ctagagcaaa tatgacatct ccaagggctg 

115201 acagtgattt atttatatgc tgagcctcct tgagcctctc tccagttacc tccgagcgat 

115261 cgaccctctc acttccagca agatcgacca agtgcaaact accacgcaaa acagaatcgg 

115321 tttccacgtc gacaccacgt acatgaacag aaagaacgct gtatacaata ggcaacaatg 

115381 tcatttctat aatcctgagg aagcgaggct tagataatgt cagaagataa aaaaatttac 

115441 aaaccagtgt gatctactac tcctttcatt gagagctgtg gcaccaacgg ttctgttcat 

115501 gagcccaata ttcatcagct caagcacatc ttcagttgat ctcacagaat gcatgcttgc 

115561 atctgggaca gctaacccat ttggtaaggc agtattccaa acccctaatg tacgcaagtt 

115621 aaggaaattt tctatggaag cagcatatgc aaacatatag ataaatatgt aagcatatgt 

115681 gaatgtacat attccaagta aaatttagca acaatattag cttagagaag gatatcttcg 

115741 attagaacca ccatcagaaa gtatgtcacg aacttgctca ttgtatatct caaccatttg 

115801 aacacctact tcatacacaa cagtgttttg tctaagctga gttaagtgaa acaagtcatt 

115861 cagagctctg taattgacac cccagtgttc ttttgaagtg atgcttggcc cactctgtgt 

115921 tagcaaaatg gaaatatgtg cgttagagag taataatgtt tggctgaaac ataaaaggag 

115981 caaagtcaag catggcgttt accattgtat aagtttttcc agatcccgtc tgaccatacg 

116041 caaatataca cacattataa ccatcaagaa ttgatcgaat taatggtcga gtatctagga 

116101 aaacctcctc tgcattaaaa aaaaacagat ttaggagaaa ttaatattac ataaagtcct 

116161 taacaagcta atctatatac ataagaggaa ggaggtttag accttgagtt gctgcttgac 

116221 cgaaaacttt attgaactta aacaaccgat gggtatcttt cccttgctta aacggatttg 

116281 caaccaccaa ttcaccgttc tcaccaatgt actctataga agtctgtcca ctgttttgcc 

116341 ccggaaggaa tggtcttatc cggcaataga ctctgatatt tcctggaaaa taaatgcaag 

116401 agaagaacag agtcatatct ttaatggtcg agtgcaaacc caaagtacac taacccaaag 

116461 tgttttacct ttcaattcct gcacttcatt gtacagtctt cggttttctt caaggaccac 

116521 atggtaattt tttgctgcat cagccactcc tttgagcttt aaaccttgga aaaatattac 

116581 aaatatgaag ccatgtattt cagactagaa aaactgacca aacagccata caaaattggt 

116641 gacataaatg atacttacca tagtaattta ggtcctcaaa gtatttcttt tgtgtcctta 

116701 agacttcttg ctttatagac actgaagtag cattcaaagc cttcatgatt cacaaaaact 

116761 atgtcattta aaagaactac cagggtattt attgtttaga atactatatt ttaaagccaa 

116821 acctgcaaag caccaaactg gttgtctatg aagctttggt aggtggactc tctgttctcc 

116881 catttttgag atttagatag gtacaacttc tcgagttctt tgaccttcct agttgattcg 

116941 gcaacttgta actctgcatc ctttaattta ctctccaact caactgtact gttttgtgct 
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117001 tttgcttcta actccaagca ctggtttgca tgcgcctctt tcaccacctt gagttcttgc 
117061 ttcagcttac gaatctcagc atcactgcgc tctttttcct ttttcaaacg caccacatct 
117121 ttttcttctg acttttctct ttcttctatt ctgctttttt caagctaaag agcacaacaa 
117181 acccattatt gggaaaactc tgagctaaaa taatttaata cacaacacaa gcggaatgaa 
117241 aatcgatgca tgagtcacgt atttaagtaa aaggggcata cacaacgact tatgatataa 
117301 tttcaacatc tagaggattg gcttaaacgt caattttcac aagtactatg caggttaagt 
117361 atgtattaag gagacgatac atattgaatc aataacatac ctttgtacgc tccatacaat 
117421 tcgaaacaat ctacaacaaa gaaaataaca gaaagaccaa tcaataagcg aaatagaaat 
117481 aaactgaaat cagagacaac agtacctcgt tttcctgagt tgccccagct gccaaggttt 
117541 ctaagacctt tattcttgac ctgtatttat cttcacgtac cctgaagagt atattttgct 
117601 gcacatatca taaacatgtt tcattacttg gtcagaagga ttatattaac acgtgcatac 
117661 aaagaaaaaa agagtattat gtactcttac gtttttcagg ttatcagctt gatttgagat 
117721 tctctgttct atcacttgca caagtgcgct caagagtgat gccattgcct gaacaacaaa 
117781 ccgcaaaagg gagaaacatg tggatcataa gaagaaatgt tccatgaact aaggattcaa_ 
117841 attttaggca aaccagatta aagttatcaa-gagaaggaag gcatcagtag ctaaatcttt 
i-1-7901 gtcttacatg agacacatgt ccattcatct tcttcacgct ctcatctagafagtttatcca 
117961 gcatgtcaaa tagtgtccga gtaggagcat tctacaaaca aagggataaa gggataaaag 
118021 ctaaacggtt gtacacagtc tactaaaaat aagttagaat actttactcg taaactgtca 
118081 gatttcagta attctgaaat tttagcatct gatgtatcaa tctcaaatcc ttccttcgac 
118141 tggaatccat caataaagtt gcggtcatct cctctggaat ccgaatggtc ttctggcaag 
118201 ctccatctcc tcctcgcacc tagtgagttt ttatcattac caccatcgga aaaacttgct 
118261 ttaagggcct taagggactg aaaaactggc agcatatccc cctgcatttt tactttctat 
118321 taggaggcat aactccataa agaaagacga gagacggcag agagaagaaa atatatgggg 
118381 gcaaaaatca tggaatctta tcaagaatag caaatcagac aatagccatg atgagaacag 
118441 cgttgcagcc agatgatgtc aaaaacaagg cttgttttaa ctgttaacaa aaattatcta 
118501 ccctgaattg gttcggtttt ataaaagcca aggtggtaaa cccacaatct ctttacctat 
118561 gtgatgcaat aatactatat aaaccaacca accagatgtt ctctaatttt caaatcattt 
118621 tttctacaga aacgaaaagt tatatccata gaacaaaaca aacctgttct atgtctgaaa 
118681 cctcgaatct gggcagggcc atttcatcca tagcagtcag aaaccgctca actctaacat 
118741 aagcaggctc aaagctgcct ccctatagaa tagccaaaga gtgtgagtct tagcagaaaa 
118801 ataaaaaata gagaaaaata aaaacagaag cttagcacca aaaaaagata tataagccaa 
118861 atggtcaaca aaataaaaat tcataagtat accattctca ttgaaccagg actaagctga 
118921 ttcaaaaggt tacacaagac agttccatcc ctcaagcatg ctctcacttc gtcctccgaa 
118981 gcttccgatg gtaaattcaa ataaggaagt gtctgattta accattccac caaattctga 
119041 tgacctgagg gcaacaaaat gaagaagaaa tatacaagaa cggtgagaac agaagccatc 
119101 ttccattcat gtctacacct aaaatgagct taaa'cttcgc cattttctta taatcacatg 
119161 aataatctct gtcattactc aaagttctga gagataaata caagaaaaag ggaatagatg 
119221 acgacattta cctttcttac taccatccaa gcttgaacga ctttcagacc cgtcgaagct 
119281 gagaaaacta gttacatcgg agggagcata gacgccactc atattggcgt cctgttcgag 
119341 attaatgttc attccaactc agagatcacg ctcagttccc atgattgtag ttgagaggag 
119401 gagaccctaa agcaactaag aaagttttgt caataaaaac aacacaaacg tactaattcg 
119461 ccgtgtaaaa tattgaaaaa ggggctccct tcatcaatct aaccttctaa tataatcaaa 
119521 ctaagacgtt tcaatttcag ataagaccgt ttaaaattat aatcaatgat gatgaaagtt 
119581 aattcccctt gtgttaacca gaatgcggac caaatcaatt acgcagatcc aagccatgaa 
119641 aagagaatta aaggatacga acctgagaaa agctaataga taaagctctg gagggtggaa 
119701 gcaacgcaaa agaaaaaaaa atggcgactg gtactgaagg aaaatgctga gattcacggc 
119761 gaggaagggg agagaaggcg tttgaatctc ctcgccggga gagaaatttc gaatttttcc 
119821 agccgcagcg attttgctca ctcttttcct cctttctttt ctcgtctctt cctttttttt 
119881 tagattccga gaaaaaacaa atgctaatta cggttccaac ttcgaaaata gagaaaaatg 
119941 ttttttcttt tccttagaat taattttaaa aaaaaaacat ttacaataca atttatgttt 
120001 gattttgctt tacgttatag atgttttggt gggccctacc cttttttcag aaaaggctct 
120061 tcccgctttc ctcactccaa aatttttata taaatttttt acataaatat ccgattgatt 
120121 atgcggccca attaagccca cgtataagcc cgtttaactt aaaatgtcct tcaacttgtt 
120181 gatcattcta cctaaatttt catgtctgtt ctttatattt gtttactagg ataatatttt 
120241 tggtgtgtat gatgcttttt tatttgattt atgaggagga actgtagtta gttataaaaa 
120301 tgggttaacc agctctagag tttttctttg ccaaaaaaaa aaagctgaaa attatgaatt 
120361 atgaatctgc ttagcttcga tcaaaccaac gaggcatagt tatgacatgg gctttaaccg 
120421 tgctttatat attcactgat gagaagaagc aaccctcaat atgccaaacg tgtctgttgc 
120481 ggagaggaag tctcagctta tgtgatccat attgataatt tgatatggcc caaacatttg 
120541 attaaaggac atacagaggt accatatata gtatcatttt cttaatactt ttatgttttc 
120601 atgcatttgg gggaagggtg tgataattgt atgaaacaac cttaaccaca gttcacgaaa 
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120661 gagttaaagc ctacgttaga gaaaatatct aagagagacc ttcgttatct tttgttatag 
120721 tactttgtaa catacataaa acatagtaca ctgtacgtat tattctctat atcaaattca 
120781 tcttgcactt gcactttcaa ttgatatcag agagaatcac tgaaatcact ggtaagatct 
120841 tgaagacaaa atagatgatg tagtacagta ggaagcatgg tcattgaaat tgttagtgca 
120901 ccaaaaacag aattttagaa gttaaaataa aaaaattact caaggtgttc aaataccttt 
120961 ttttttgtgc acttttcatt taccagtctt aaaatactac ataaattagt acaagaagta 
121021 tactacagga agcgaagatg tttaattgac ttgaagacta gaattggagc cggcctaaag 
121081 ttgaactaga agttacttag agaccataca atcaaagatt tactttggag aaaaagaaac 
121141 tcatcaatgg aaatttttca aaaggctttg aataagattt ggaaaaacta taaaggagat 
121201 ttgtggaaat ttatcctttg gatagttagc tatggtgagt caagaaatct acggagtata 
121261 attatgtact agatgtttgt ccgcgacttc gcggatgata atattatatg aaaagtacat 
121321 gttatgttta taacgttata gtgttataat agttgaatag taattagaaa ttttaaattt 
121381 tagaatttaa tgttgacaat ttttttttag aatttaatag tttgataatc aaaaatttca 
121441 gtatatattt ttagaagagc tgtgaagttt tgagatagtt caacaacaaa aatgtgtcac 
121501—aaaaatattt ttcttaattg caagtattgt tacaattgat ttaagtaatt ggttttaaat- 
121561 ttccttttag t ttattt taa~ catgtgaaaa aacaaaaaaa aaacatttta gccataaaaa 
121621 tatattttgt gtagttccaa aaatatggtt agaggtctct: atagttctga catcttcttt 
121681 gatgattttt tatgtttctc tttgtttgta tgattgtttt caatttattt taaattcagc 
121741 aaaaaataat tgttttatgt gtttctttca aaaaatctgg caaaataaaa gttcggatgt 
121801 aactaaaaac tcagatttat tttataaatg agttaggtag tctttccgtt tcaaaatgtt 
121861 acatgttttg tattttctta ataattgtgt aaaaacctta aacatcactt aaaatgatac 
121921 agagggagta ttaaatttaa gtattaaatg cataactttc agtgattatc gatttttcat 
121981 gattctaacc aataagaaat cagtaaatgt aattaatttt ttttgaaatt ctcaattaat 
122041 cattattaat tgataacaat tgtattgaaa atataaacca atgtatcttt ctgaaacaaa 
122101 atttttttct tagaatatgt atctttatga aatggaggaa atattaaaca gtatgtaaat 
122161 caaaagtcta tagataattg attgtaaaac taatttttat tttaacactc taaaattatt 
122221 aaaaactaaa atatcaacta ttgaaattgt aagatttcac caatgaatgt atatgtaaca 
122281 aaaaatcatt tgggttattt atattttaat atataaataa acagtgtgtt ataattttat 
122341 atgatttaaa tgacttattt atttttaagc ataaaacgat aagtcactta ttatatttta 
122401 atagataaga tataattgtg cgataattta tatgattttt aaatcattat aaatgtacat 
122461 ttatacagtt ataattagat ctagaaacat ttaccatttt gcatctaaac aagatatgct 
122521 caagatttat atgaataatt ttgataaaag tcttaaatga ttcagaaact ctaataaaat 
122581 gattttatag gccaaagtta aaaaaatgtt ttaaagtaat ttcaaaatat ttgtaaatga 
122641 tttataaatc ttgtgtgggt tttaaaaaac aaaaacaatt ataataatgc tctatataca 
122701 aatctaagtt tttatataaa aatatataaa aatattttaa ataattataa caataaatta 
122761 taaagaaaaa aaattgacga aatattatat attctttata taattatgca attagtgacc 
122821 ataaactatt atttgtaata ttacttgtgt tttttagtaa cttgcattaa ttgattctag 
122881 aattttcttt tatttttaga tttgaataaa ataaatattt aataaaagtt atcaaccaat 
122941 caaactataa caatttcttg aaacttcaca tatgattgac acgtcagtaa agtgacttct 
123001 caattaatat atatagggat aagtgtgtct tgtatcgata tcatttaggg atattaaaat . 
123061 gagtttcctt aatgggtttt aaatggggtt ttaatataaa taaatgcaca tttacaccca 
123121 tttagtttat gtgatttaaa tgggtagttt tgttaagaca catttattaa atggtgtttc 
123181 acttaaaatt agtggtgatc ctattaaccc atttaactta atacatactc cctctgtttt 
123241 ttaaagatgt atgttttggt gttttcacac atattaagaa aacacattaa ttatacatca 
123301 tttttagaaa ttatcaaatt ccaatgcatt ttaaccaata gtctttcaat aaattcaatT 
123361 aattttfattg aaatttgcaa ttttgtatag gaaacataaa aaaatacatc tttgtgaaac 
123421 aatttatttt tctaaaacat ctatctttaa aaaacagagg gagtatttat taattcgatt 
123481 ttgtgttttt ggcggaaaac ttgattttgt gttttgacta gaaacacaca attttacggt 
123541 tatgacatga agtaatgatt ttgcggcaga aaaacgtgat tttgtggttt cggcgggaat 
123601 tttgattttt cggtttttgt gagaaaatta gattttgcgg ttttttgtga gaaaagcttg 
123661 atttcaattt tgacggaaaa cataattttg tggtgggaaa ataaattttt cttattttgt 
123721 cggaaaatgt aattttacgg atttcgcagg aaaatgagag tttactgttt tagtgagaaa 
123781 catgattatg tggttttagc gagaaaaatc aattttaacc atgttctttt tgtaaaatgg 
123841 cggcagcgtc ccgcgtctta aaaatacact tgatcttgaa tgaatcattc ctttcgctgt 
123901 ttattgatta agacgctaac agtacggcca attaaaatta acggggattt gtgacgctgt 
123961 aaaatccagc gtcagaataa atggagatgc ggtggttttt tatgaaacgc agaagcatca 
124021 atcaaacgag gatgccttaa aaagctgttt cctttgtatt ttttttaacg ccgaacgctg 
124081 ccgctgcgtc taaaaaagaa cagagatttg tgtttttagc aagaaaatcc aattttgtgg 
124141 ttttcgtaca aaattttgat tttgaggttt tggctggaaa atctgattgt attgtttttt 
124201 tgtgggaagt ttaattttgc agatttggcg gagaaaagaa attttgtgtt ttggtagaaa 
124261 aacatgatta tgtgcttttg gtgagaaaat ctgattttgt gttttggcag gaaaataatt 
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124321 ttgtggtttt tgaagaaaaa tctaattttg cctaattttg atttaatcgc ttaatttccg 
124381 cgttgatcgc gaaaagttga aacacaatta attattttct actcattatt gacagatttt 
124441 gtgttaatat tcattaccaa gtaattacag ttagctatta aattcaaaac cattgttcaa 
124501 acccaaaaca tatacatact -acatttgagg atttctttgt atcaaacaaa cactgatcta 
124561 aatgttttga caaatcaaaa ggatattgtt ttaaattatc tataaaagtt tatataatta 
124621 tgtctaaaaa cttgtgtcat catttttaga attagctaaa catattataa atatttaaat 
124681 tgtatttaaa attagacttc acgatatttt ggtaactcgc taggcacggt tttaccactg 
124741 gacaagcgcc taacgcgatt ccgaacaagg atatatcata ttaacatccc tgataccacc 
124801 ttaatcataa ggattttggg atttaattta agttgacaca aagtataagg gctaagagtt 
124861, tataaaggtt tttgggttgt gtacctcact accttgaaca tgtttagttt aagtgtaaca 
124921 tgtaaaaatg ggaattgaat tctatgaaat ctaatctctc gctgttgctc agtagttcag 
124981 tggtaataga gtgattgagt gttaactaat aattggtcgt agttcaactt aaactgagta 
125041 atattttttc tttgttaaga atactttgat tatttttggt tgaattaatt tgtgacataa 
125101 acaaaacata atatctgtat gtgttatttt atgtatcata ctcatcttac acttttaatt 
- — 125161 ggtattaatt t tacaagttt ""taatgtacga tttacgtacg tatcttacgt aaatggacat 
125221 gtgcattttt ctfatftgttca tgttcatgcg agaactatca tccatagatc gtgcaaaaga 
125281 gtggtgggga aaaataactt ctttataaaa ttataacttt taattaatgc gataaaaagg 
125341 caagagaatg tagtttcgaa ttaattacac cacatcatat ccatcaaaat catgaaattt 
125401 agaatttttt agcaccatgc acgtggacag taattttaac ggaaatcaat tagtcaaaca 
125461 atcaaggcac ccagctgttc cttatgaaga aaaacaaaag cttttgatga aactaaaatg 
125521 tcattttaca gaacatatac tatatagtaa tagttgtagc tactactgat tgttgtcatg 
125581 actgtttcag agtgtagaac accatctgat ataaacatgc atgcatgtat atatgttcat 
125641 tcaaatggtg aaatgtattt gcgtacgcat aagcaaaaag acaatgaaac aagtcaaaca 
125701 acacaagact atgggtctaa ctggtgacca cctcgggaat agtaggaaca aaaaggaaaa 
125761 gaattgatgg gaataaaatt atgggaatga tgaggaacgg ttattccata tcaaatttgg 
125821 tgaggaataa atttgttctt taattctcta caataaaagg aatgagaagg aatgaaaagg 
125881 aatgactatt cctaatgaat ggtaaaafctt gttaggaaca ttaaggaatg cattattcct 
125941 cttcattcct tggtcaccgg ttagacccta tatgtggatt atgcatatca tatacacgtg 
126001 aaaggtacct ttaactatca aattgatttt gtagactcct tcaaaccaat ctaggtcact 
126061 tcgaactgta gattatccct tacttattaa tacagaagca tttttaggat ataaccttta 
126121 ttttgtattt tattcacaag tttgccatta taggtgatct gtcattacca tagctcttct 
126181 cagctttcgt ttttaataac cctttgctta ctagagaaat tactctatat gccattagta 
126241 cacatatgac ataattggct tgataacata ccaatttaat aagatgggtt tttgttcttt 
126301 tgtgagattt gctcgccgat gatgtcgggt atctcgccgg atatctgagt ttgacgacaa 
126361 gtttagagct ctaaggttgg agagtaaagt aatcaaagtg ggtaatcatc ctttatcaga 
126421 ttatctgata aatcgaaaat gtgagatttg agtggaatgt tttagggatg tttacggaga 
126481 gattcgtgtc ggagatggtg agagagacga ttttttttgt gactttttcg aggatgtcaa 
126541 agagacaaga ggtggtggag acaggggtgg aagagacagt gagatcagtg aggttaacga 
126601 gacgagcggg tagagtcatg agaggccgtt gcggcagcgg aggaatgaag agacggcgga 
126661 gaaggagtgg agtgcgtcag ggaggcgatg tgcgagtagg tgggagcttc gaaggttact 
126721 acggcagatt cgttcggtcc tgtcgttttg ggtggtacct ttgatcaatt gctcaatgga 
126781 catcagactt caaggttaat tgttgttttc agtctgtggt ttctgcttaa gaatcgaact 
126841 aatttgtcga aatatttgtt tgtggtgatg gagaaatcag gcggtggaga tagccatgga 
126901 tcgaatagta gcggggtatg tgatatcccg atgttgatga acaaacaaat aagccatctc 
126961_tccctcttga atctgctctc atctctccct tttcgcctag attagtttag catcctacac 
127021 catttgatat agcaagtcat tcgagaacga ggaatctagc aacttatgtg aggttgaagt 
127081 ctcttttatg gctaaaattg attgtactga atgtttctgt ggtttaaaac tgaaactgtt 
127141 tagctttgtt gatgagctgg aaagttcaag gtctcaaaga cattaaagag ttcaattccc 
127201 tgcttatgtg agattggagt ttaagtccag ttgccttatt agttcttatc atgagttgca 
127261 aagacgtgtt attgtgatct caaggcaaaa ggaagactct ttttttatcg tcttatatag 
127321 tgagtttgta aagctgaaac ttttatgcat tggtttggct tgaattcttg ctgtctgaac 
127381 tcactaatgt cttggcttga attctcagcg agcttgctgt ctgaactcac taatgtcttg 
127441 gcttgaattt tcagcgagct ttcgtctacc ttacttgctc tgtttttttc ttctttcatg 
127501 gcagctctgt ttatttttct tctccttcaa cacgttacca tctaaacatg gtttttaaat 
127561 aaatattttg cattttgtta tcctttcata tatcaaaatt atcaaatcga gttgtcatta 
127621 ctaagatggt ttcttccagc tatacataca catgctatag aacacgactt ctttctcttc 
127681 aagacatgtg ttgattatac tatacagtgt ggttctatat aattcttttt aattaagata 
127741 tcagtacttt acagtttgat tatctttcta gcaagatgtt acgagggtga gatattagca 
127801 attctgaaaa tatgcatttt cactgttgtt atactcatat gtataatcgc tttatcttgg 
127861 tcacagaata aaaatactta aatatgaact catacatttc attgtcaatc gtgtaatgtt 
127921 gcatataaaa catacgaggt tatattctaa tttcaaaaaa gtacaaaagc aaagtcgtga 
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127981 gtgatcagat cattgtgaat caggagtaaa tcactatcaa aggatcaata gttatataaa 
128041 ccagcagcga ttcacacata cataccattt tcatctcatt gatgctaggc ctaatacatg 
128101 ttactttaaa aaaaaatgga aaatgaggtt tggtatataa tatggccttt tggtaacata 
128161 ccgtcaatac atgcattcgt tagttatgct taaattttac ggaaatatta aattgtgttc 
128221 aaagtgaatt tatgtgaggt gaatataatt aagttgccat tatgtttagg tatattctaa 
128281 aacaaaatat tggtattcat gtgcccagaa aatctcgctc acacaacaaa gagatacact 
128341 tatggtccaa atactataac gtatcaatct aaattaaaaa aaatctcaaa gttgaaaaat 
128401 attctgtagg aatttgataa tgtattccaa tttcttacac attaggtatc catatcgatg 
128461 tactaatata tagaggtgtt cagtctgata gaactaacca attaaaacca aacaaaatcg 
128521 aagtagaaaa aaaagattta gatttgatag ttcaaaagaa gtcgagccga tgttatcttt 
128581 aaaaaataac accgaaaaat tgattaacta aaaaattagt atatttatat gtaatttttg 
128641 taataaaatt aacattaaat aatatataca aatttatttt attaatatgg tattcataat 
128701 caaaacattg attaaatatt tttttatttt tttaagttaa aatggttatt ttatttttca 
128761 ttttcatcaa attaaataaa acatattttt gtttttattg ataacttctt gtgataaaat 
128821 ataattatgt aataatcctt tggtattatt actttttact tttatttgaa tatttaaata 
128881 aaatatatat taatgtttta ataaatctga tttatattca tataatctaa tttttttaaa 
128941 aataaatata tttatggttt cttggtataa aaccgaatga accaaaaatt aatggtatat 
129001 aaacgaacca aaccaaatta tatataattt caatatgata gcgaattttt ataaattgaa 
129061 aaatcaaaaa aaaaaaaaac tgagatagta tttaattctg gaggcctgat taatattata 
129121 aatatttgtc attaattctc tgtgtggtta gtacatacta tatgagcaag ttatgcaatg 
129181 cacctttttc tgtctcattt agtttactgt agtacatttt attgtacata atattgtaga 
129241 tgttatcaat aatgtattct gcaaactata aatgaatatg taatgttttt ttagtaacac 
129301 caatgaatat gtaatgtaac ttatgctatt ttacaatagt ttcattccgc gcatagcgca 
129361 ggtctctacc tagttctatt agtactaagt aaaccgaata caggtgtgga cgtcacatta 
129421 ttacaagcta agcttaattg gttaaagtaa ttaagttcgg ttggttttag gatttatgat 
129481 agaatcatag agagtggctg gtgctgacaa tgttattaat gcttaattaa taagctggac 
129541 caccaaaaat ggactgttcg ataatcaact gaagactctg atccactcaa taaatgcgag 
129601 catctctttc attataaaat aggttttatt tatttttttg gaacaaaaat aggttttatt 
129661 tatcttcaca tgttcttaca tccaatgttg gaatatatca ttttatcttc acttatgtct 
129721 gcgccatgtt atttgtaacg caaaaaaatt gttctcttca ctggttatag ttaacataac 
129781 atgtaacaaa taaaaaacat gtaaaataat tacatattta gctgcctagg aattgggggt 
129841 atagattaaa tattatttat aattaaatcc tttataacaa aagtcaaagt gatccttttt 
129901 ggagtgaaaa ttagaatctc tctgatataa aatactttgt ggatatctca actaaaaaag 
129961 ttagaaagga actgagtatc tcaaaatgag ggatttctca attagatatt caccttagat 
130021 acttttgcaa taggattact ctcttagcta gaatcatgtt tacttgtcct cgcgaggacc 
130081 acaatgtaca aataacacgg atgacatgtt catacaaaat tatgaataat ctgtattagg 
130141 aaattactga aaaggttatt ccttcaattt cacattaatt gtcattctat aatttaaatt 
130201 ttgtttaatt ttagtacttt ttatgtttta atgtatttat ttgttagttt ttctaatttt 
130261 gtctatgtta tttttgttta tcaattaaga taagtcaaat aaatactctt acatttgtga 
130321 ataaatgtca ctttgatatt tttaacacag attaaaaaaa tgacaaaaat aaatataaat 
130381 tattattaat tacacatttc tgatcaataa tatttgagat aaataaaatt atttataaaa 
130441 tcaatgcagt ttacaattaa ttttcaattg aaagtaaata taatttgtat tgaaattgta 
130501 aaatgatatt tttgtgtaac aaaaaaatta gaatgacaat tattatgaaa taaatggaat 
130561 caatgactca ttattatttt ctgtacatct aaactagtgc tatcatgcgc tttaatgtga 
130621 cggaaaagta tggcaattag tacgagaacc tctaagagca ttttcaacag accctcaaat 
130681 cctcaaattt taaggttttt gtttcttcaa cagatcctta aatcttcaaa ttttgaaggc 
130741 ctgaatagtg aaacctcaaa tttgaggttc cactattcag gccctcaaaa cactatttac 
130801 ttttcagttt agttcttata aatttgtaac ttacatatat ttgtactaat attaattttc 
130861 ttattatttt aatccttgta actatatttc. acttaaatct ttaaatatat cttatatata 
130921 cagatatata tacatatata tacatatata ttattacata aaatatgaaa atatgttaaa 
130981 actaaaataa aataaataaa caaaaaatat tacttaaaag atatttatta tttcacaaaa 
131041 catttatatt aaattgaaca atatattaca aattaatttt gtgttataat aaattgtatt 
131101 tataaatatt atgttttata atttcagtgt gtatgtacta tctatctatg aaaaatataa 
131161 taatgatagt tgatttatat taaagttagg gactgaaatg caaattacat aaaattttaa 
131221 aaataaattt gaggatcctt gttggagtaa cccaccttca aaccctcatt ttgaggatca 
131281 ttggccctta aaatgaagtt ccttgttgga gatgctctta gagcaacttt atcggcagca 
131341 aaatcacgta tcttaaaggg ggggaccacg tactttttgc aaaacccatg acagacgttg 
131401 gttaaataag agacgtttgt gctgcgtttg ttaacctgtt tgcggactct actgacacgt 
131461 agtgatccgc gactggctca attattatta ttttttttta aatcagaaaa atgcaaaaaa 
131521 aaaattaaaa ataaataaat taaaaaccta acaaacgtat caccgataat gatgccctaa 
131581 ctgactaggg gtagtatatg gttcagtatg ttaattcaaa cacattaaag atttagaatc 
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131641 aaatgataaa taatactatt tggccgtgta 
131701 aaaaccttac tcctttttca aggttcagac 
131761 ttattttgag agatgcgaag attttattaa 
131821 ctcttggttt cttgcacagt gttttaaata 
131881 tcaaacatat acaatcacag ttgattctct 
131941 tttgatccaa gcttcaatct ctctcaatat 
132001 gagtctcaaa taactttagg aaagtttgag 
132061 actatatgaa atgcaagctt ttttgtctgt 
132121 aatttcgtgt tcttcgtcgt gtgtagccat 
132181 taagctacca gcattaattt atattttctt 
132241 gttgtccttt tggcacatcg ttaaaattgg 
132301 gcgcaaagat gatacgcaca aataaaaaaa 
132361 aattaaccgt tgttgttggc caaaataagt 
132421 atattattaa atcagaagta cacatataga 
132481 ttctatgcca ctaacgttaa ataatacttc 
132541 taatgcattt tctaaaatta aatttgaatt 
132601 taatgctttg tattttccac ttacattaat 
132661 atatttcatt aacttttcaa ttaaatcaat 
132721 caaacaattc atggaaatta taatatcaac 
132781 tattatcaaa tttgaaccga aactagatac 
132841 agagaaccat aaccaaacct tatctgaacg 
132901 tcatatttat atacttcaat atgttagcta 
132961 ttttaagttg tttaaaatat ttgaaatata 
13 3 021 gataaacaat aatcaaaaca ccaaaatact 
133081 tattcaagtt aaacctattt taaattttta 
133141 tttacatgtt atatttttta agatttaggg 
133201 taaaaataat ttaaacgggt tatcaaaccc 
133261 tttataaata cctgaataga gctagaatct 
133321 tttaaccgaa ttcgagtgga tatgcaaata 
133381 tcaaatatta caaatatact atttagtata 
133441 gcggtcgcac gggtcaagat ctagtttatg 
133501 tgccctttat atattatttt aaatctgatt 
133561 cttaagaatt atggttttat actactttaa 
133621 ataatatact atatttaaac cattcacaga 
133681 caactaatat attgtggtca gatctaactc 
133741 gttttgattt catgttaagt ttcatccata 
133801 tatagatagt gtaagtttcc ttagtttact 
133861 gtatagtaca aagagaaaga acaacttagt 
133921 gagaggtgtt tggaacacaa tattcaacct 
133981 aaagattata ttgaacatga accaagttaa 
134041 ttcgaaattt gaagtcattc aaaaaatgaa 
134101 ttcttcaaaa aaatatattc tatatattct 
134161 acactttata cttttcaaat ttactagttt 
134221 gactctaata gtaagttaaa cattattata 
134281 gtaaacatac acttttagat tttataaaga 
134341 atcagaatat tataaaagcc aaaattgagt 
134401 catagaattc ttaaaattta ataatatcat 
134461 aagcaataat ataaattttg aaattaaatg 
134521 aattaaaatt atagatcagt ttgtaaacaa 
134581 cattattttt tcaattgaag aaatactgtt 
134641 aaaataaagt accactacag taaaaaataa 
134701 gatgatctta cacaacatat gatataaagt 
134761 gccaccgaaa aactacattc cgactatttt 
134821 cttttaaaaa ttcctcgtat atttatgtat 
134 881 aagaatgtat ctaattcacc tctctattta 
134941 tattaaacac attcttattt cttttacgaa 
135001 gagaaagtaa catttctcta tatttagctt 
135061 tatcttttta taaaaaaaaa tctatgttat 
135121 agatggtcta agtgtcataa agaaattctg 
135181 accagacttg catttatctt aaaagagttt 
135241 tacaaggagc caaccaaagt gataaactga 



atcatacaag aaataaataa aacagtgaaa 
acgacaagaa gtagtcctag ttaggggtgc 
aaacatacta tagcttacgg tattaaatag 
tctcatttaa atacatattt ttgtcaaaac 
tcctgactct aattaatttt caccaacatc 
gcaaccttat tgtttgactc gcatggaaca 
caagtgttaa cattcgaaat caatgcagaa 
ttcccgggag atccttgatc tataacctta 
tagaatcagt tatgtgtttt tttttctttt 
gtgtatatac tccgctccta acaattaatt 
aacgaaacag aaaaaattag cataaccttt 
tagtccaaat ttttagttag caaaaaaaac 
cttggttgga tttatatgaa tgttatatct 
atgcccttta gtttttagta ttatttacac 
ctattttaat gctgtctttt ccactttgat 
aaatacacaa ttaaataata cttcctattt 
gtattttcta aaattaaatt tgaattaaat 
ttcaaattaa aaaaatacat ttttgttgga 
tcttcattta acaaatctga acgaaaagaa 
tatccaaacg gatttaccat tttggtatct 
aaatatttca gagattcgaa tgtatttaaa 
ttttcgagtt aatatccaat atataagtta 
aaaaatagtc gaaagtaaac atctaaagta 
taaaatatat atatttatta ttcatccaaa 
atttaggtat tttgacttac attactcaaa 
atatttaaag atatataaat ttaaaaaatt 
gcaaagatcc gaataaaacc ggaaccaaag 
ttaaactcaa aatttcaaac ccgaatagat 
tgcatcccta gcttagtcaa tataaaacga 
aataaataaa aactgaaaat taatacacgt 
aattatgata ctaactaata gattggtcaa 
atatattttg aaataggtta ttaatacgtt 
cttgaactga attccattct acaacacata 
tcgtggtcaa taaaaccata tataattgat 
gtataccctg ttaaaaaaat gatcacatat 
aaccaatagt gggtaagcat gtgtgttgat 
taatttcgaa tgtataactt ccactcaaaa 
ttttcttttg acaaataaaa tacaaataga 
aaaggcaagt acccacttct tttgtttctt 
gatatctcca atgataactt catttttttc 
gcaacaagtb tttcaataat ttgcttcatt 
ctatcactat tttatattta attattagta 
tactcattta ttttgtttta acaataatcc 
tgatataatt attacacatg acataaataa 
aacaccaaat acaaatatct taaatgatta 
ttattttatt ctaatttctt ctaaatcaaa 
gtctttcgtt attaacatct aattagatga 
ggtggcattt aaacatacaa aataacaata 
aaaaatatat tctaaaaatg aacaattgaa 
caattctcaa aattttaaga aataaagaaa 
agcaaaaata taaaatgaag tactattgaa 
tggcgagtta gctaattccc aaagattaaa 
ttttgttacc actcctttaa aataaaacta 
tttatgataa tataaagtgt ttgtttacct 
catgtataat agcatttaga gataaaatat 
attataaatt gttatttttc tttatttatg 
ttattttgaa attgataatt tagagaaata 
aaaaagaaag taaactatat tggagaatgg 
agaccatcct ttcttttttt ttttgatcaa 
aaactccatt agaggaggat acaacaaggc 
taacacaaaa gataaaattg atacaaatca 
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135301 caaaaagaaa ccataaaaat cgcatgacaa 
135361 caagtgtaac ccaaacggtc acaaacactg 
135421 aggaagatcc tttgagtacc agcaacgctt 
135481 atattgggaa gagaaacaac tccaaacagg 
135541 tcagattcaa caggcaagat atgtcaacga 
135601 aacacaagat aacagcacca aaaaccaggc 
135661 taagaaatcg cactgaaacg acatagatgc 
135721 aagctctgtc catcagaaca ttaccaatta 
135781 atacaactcg cattccaacc cattcaactg 
135841 gatgaacctg gaaggctgta accggagggt 
135901 cagaccattc cacgagccga caaggaaacg 
135961 agatctttgt cgccattgtt gaacaatgaa 
136021 gaagagaagg cttagtggag gtgacgcgga 
136081 ggtctggagg taacggcgga ggagcgacgg 
136141 acggcagcgt atctgcagga ggtctgtaaa 
13 6201 tggaggagga gcagagctgc ttcaaaggga 
136261 gagatccatc caaaacggca tacctcggga 
136321 ctgagaaaag ttagcagttc tttgagacca 
136381 caatagtaac ttcaaaatac aaaatactat 
136441 aataaaaaac actatttaac tatttttatt 
136501 aatgttaatg ggctttgttg tcttctccct 
136561 tcaccaactc ataagtcaaa ccctataatt 
136621 atggctaccg tagaaaacag aatgatgccg 
136681 gttcgaaaca ttcaatggag ttacggaatc 
136741 tacatctcct aatctttacc ctttcctctg 
136801 atattgagac attgttgttt ctaatcaaga 
136861 ctttcattct ttctcatttc aaactttatc 
136921 tgttggagtg gggagatgga tactacaatg 
136981 catcggaagt caaagctgac cagttgggtc 
137041 acgaatctct ctccctagcg gagtcttcaa 
137101 cttccgcctc ctctttgtct ccggaagatc 
137161 gcatgtcttt cgtcttcaac attggtgaag 
137221 tcaatcaaac cattttttta aaaaaaacga 
137281 attctaattt ttgattaaaa aaattatgta 
137341 atgtatacta ttacaaaatt tcaaagatca 
137401 tagtcaaact tacttataag agtactttga 
137461 tattatcatg tctccttcag ttctcatctt 
137521 ttgattgtta aaaacgcaat gtttttacgt 
137581 tgagttataa atgatgatat atagacgcat 
137641 atactattaa agtacaagca catttggatt 
137701 ttcttttgta ttcattaatg atattttgga 
137761 tttggtttgt tattaaccac cggtttccta 
137821 ttgatactgc atcattttaa tatttttcca 
137881 tctcaaacta tttaatggtt tggttttaaa 
137941 ctaacaaaaa ttatttataa aaaataaaaa 
138001 ggtaaccgga tttcgatttt aatagttttt 
138061 cccgaactga atttatcttt gatcaaccgg 
138121 tttcaaaaca ccgatcaaaa ctataaaact 
138181 ttcaaaatta aaaacctaaa aatacatgta 
138241 tataaaatac atatatgaga cgaattatat 
138301 gtgattatat aaaattttca cctaacataa 
138361 tagtaaaatg attaaaaaac caaactaatt 
138421 ggaggagtgt tggcgaacgg gcaaccaata 
138481 aaagtcttca ctcgctctct tcttgctaaa 
138541 tggaaatgct aacgctataa ctaaaacgtt 
138601 agtgcttcgc ttttgacagt ggtttgcttc 
138661 acgaccgaac atgtaagcca atatttttat 
138721 cataggagtt caaaccaatc aacatgcaga 
138781 tttttgttca aatatcctgt taattttttc 
138841 gtatttctag caagtatatt aaacaatata 
138901 acataaaaaa gtcccatctg acaaaaggtt 



cgatttcata agcttcttta ataatcgtac 
catcaacagc agaggggagc attagctggg 
tatcttccat ttcactccat acaagaggaa 
agacaattgg gagaccaata cccgtcaaga 
gttttccttc cagctgcaac aaagaagaga 
ttgacacaat aaaggaactg catactgcag 
cactaaaagg cacaaaggta gaaccgaagg 
aaaccaatga cctcaagttc aaagtcgccc 
aataaaggta agcggagaga gaggagctat 
gggaaggaag gtcgggacaa gatctgagaa 
acctgcacac aagagggacg ccaccaccac 
gctttaaatc aagggtgtgg gtaagagctt 
tgcgaagaag agaaaggaga gcgtctggag 
gagagagaac ttcaaccgga ggatactcgt 
ccaacggagc agcattcacc atcagggcca 
gtctgaagga gcgacggagc tgaagcggtg 
cgcgggcctg agagaaataa ttagggcgaa 
tcctttctct ctcgatcaat tcaaccacaa 
ttaacttttc tatttttaaa tttttttatc 
attctacccg tatatggaag gttcaagata 
tcccgaaact gaaccagtgt catgtaaaaa 
ccgttctgct tttttccttg tctacaattg 
gaaaatctaa agaagcacct cgctgtttca 
ttttggtctg tctctgcttc tcaaccaggg 
aagtttacag ttttcttggc tttctccacg 
actgacacaa aatatctggg cacatttcaa 
tgtatgttgt gggttaaaca taaaaaagac 
gagacattaa gactaggaag acggttcaag 
ttgagagaag tgagcagctt agagagcttt 
cctcctgtgg ttctcaggtc aatagacggg 
tcaccgacac tgagtggtat tacttagtat 
ggtaaatctg tctttctcct aatcccatca 
cgtgacccta acttttatta tgtactcttt 
ctatatttct ccttagaaaa ctgatagaaa 
atgtattttt gtatagatat gaattaagat 
ttttgtctaa caaaaaagac aaacatcctt 
ccacgatagc ttattccgac atggacgagt 
ttgcgtgttt acaaagtatc aaccttaata 
ttttgtcgtc cggttaagca taaaacatct 
tttactcagt ttacccctat taaagaagct 
cattctgtat tggtttcttt tttaattgtt 
attcaatttt ggtatgtgat tattccgtgt 
accggacatg tttgaaactg catcgttttt 
ctgctttttc ctaaatataa tcccaactat 
ttgtttattg gttcaaccag tggttcaacc 
attggatttt ttaatttttt ttttttcaaa 
taatccgttc aaccgcaggt ctaagtcgaa 
gttatattgg tttatattta aaatatctaa 
taatatagtt ttatcaaaca taaatctgga 
aaacacatat acaaaacgaa ttatataaat 
acccgcgctt tgaaagcgcg gatcaaaatc 
tttccgtctt tttattattg tagagttccc 
tggttatgta acgctcatac cgccgatagt 
gtaagttttt gggtagtttc tatttgcaac 
ttttcttgtg gccgcttttg ctcttttaag 
ccatttcttg gaggagttct tgagatcggc 
atttatccta taaactcagc cgtttcgacg 
tacatattat cctgttaatt ttttcgtttt 
atctatcaaa aattagcaca aaaggtaaaa 
atttacattg atttgataaa tagaaatagt 
taaattctcc gacatttcta atcaatattg 
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138961 tcataaagct gttatatttc cagcgcttaa tagtgaataa tgaaagttaa aagggctaat 
139021 ttattatttt ttgttaaatt attttaggtt gcagagaact taaacgtgat acaatgcgtg 
139081 aagacattgt tccttgaagc tcctcatgga actttatcaa cgagatccga ttatcaagaa 
139141 attttcgatc ctttaagcca cgataaatac attccagtgt ttggaactga agcttttccg 
139201 acaacttcca caagcgtgta tgagcaagaa ccagatgatc atgattcgtt catcaacggt 
139261 ggtggtgcat cccaggtaca aagctggcag tttgtgggtg aagaactcag taattgcgtt 
139321 caccaaccgc ttaattctag cgattgcgtt tcccagactt ttgttggaac aaccgggaga 
139381 gttacttgcg gtccaaggaa gagtaggaat caacggttag atcagattca agaacagaat 
139441 aaccgagtaa atatggacga cgatgttcat taccaaggcg tgatctcgac gattttcaaa 
139501 acaacgcatc agctagttct tggaccgcag tttcagaact ttgataagcg gtctagtttc 
139561 acacggtgga ggaggtcatc attgtctgca aaaacgttgg gagagaagtc gcaaaatatg 
139621 ttaaagaaga ttatatccga ggttcctcgg atgcaccaaa agaaggcgtt gttaccagac 
139681 acaccagaag atagcgggtt taaggttggg gatgaaaccg cgaaccacgc cttgtccgag 
139741 aggaaacgcc gcgagaaatt gaatgatcgg ttcataacgt tgagatcaat gattccttca 
139801 attagtaagg taacctaatc caacactttg _cttcaatagc gttttgaatt tacgacaagc 
139861 gttttgaatt tactacacgc-gt tttgcatg tgcatgctaa aatattacat tatattttca 
139921 aggaaagtaa tcttatatct tgtggattct tcagaccgat aaggtgtcga ttcttgatga 
139981 tacgatcgag tatcttcaag aacttcaaag acgggttcaa gaattggaat cttgcagaga 
140041 atctgacggt aaagaaatgc gaatggctat gaaaaggaag aaaatggagg atgaagatga 
140101 aagagtatcg gcgaattgtt tgaaaagcaa gaggaaggag agtgagagtg atgtgaatgt 
140161 tgaagaagat gaaccggctg ataccggtta tgctggtcta acagataatt taaggatcgg 
140221 ttcgtttggc aatgaggtgg ttattgagct taggtgtgct tggagagaag ggatattgct 
140281 tgagataatg gatgtcatta gtgatctcaa tttggactct cactcggtac agtcttcaac 
140341 cggggatggg ttattgtgtt taactgtcaa ttgcaaggta cagctactaa cacaagctaa 
140401 atttctgtta cctaaccgga atttgagtaa accgaaactg atatatattt taatgctgat 
140461 tgcagcataa agggacaaaa atagccacaa caggagtgat tcaaaatgca cttcaaagag 
140521 ttgcatggat atgttaaagg tctcacaatg ttttagattg acagaattag cttttattcc 
140581 ggtttaattt tctaatcttt ggtactcaga tatttaaacc ggattttttt ttgagatttt 
140641 gaaaatcttg gcagatcctt tgaaattctg gtaaaatata atttcttttg gttcataacg 
140701 tagtcgtaga ctaatatgag ttccttagtt gatatataat gaaggtaaat cagatgatgc 
140761 gaaatgaaac actcggctta gttagtacag tatctattct tgatgatagt acggtttgca 
140821 aaagaaaata gaagtaaaat tagtagcaaa tgcctatgtt gtgactagtt aaagattcta 
140881 ccaaacaaat catatactaa aatctaaata gatagaagat agttcgattt caatattctt 
140941 ttggtatatt taggaaagta atgttttaaa gtatagtttg aaaatataaa gataagattg 
141001 gttggtcaaa agctgctagg aaagtgtttt gttcacaaaa tacaatgcct tagtaaaaag 
141061 aagtaatgtc atttcaaata attgcagcta acaccactga gaaatttgaa catgcgttac 
141121 atattttaac aattacttta tcataaaaag aaggaagaaa atcttagcac aaggagttgg 
141181 actgtatttg gatttcgtat ggtagcatcc aaaataaagt ctatcttttg agtaagaaga 
141241 tgttaacttt actgcaaaga atacagaacg aggtgttaga aaattggttt gagacctttc 
141301 ttgatttaga acaagaataa taaaggagaa attctttaat gagattggtt gacatggtag 
141361 atggtctcta acatgatcca tcagttggtg aagttgacgc ttcgtagatg gttgtttagt 
141421 aaaacacaat attaatttgt tcgagaagat tatggaccta tgcatagaac cttttttttg 
141481 gttaaccaac aacatgtgtt gatccaacta gtgtctataa aggtggctat ataccaacta 
141541 gtttgtatac atcaattaat atgtataaat tcgactgtga catgcataac gatccagcta 
141601 tagcatgcat aagttaagac tatatagcat gtgtctctaa tgtacgatcc agctatttgt 
141661 gggacaccat aataactata attcaaggct gatttctaaa gctagaaaga ctatggaccc 
141721 gagtatctgg aacatcctag ctaactacta tatgtagtag ttctactaca acatatatga 
141781 ccacattgtg caaacctgca tatataatga aaaaaatctg actacatcat gcataaatcc 
141841 agttataata acatacaaaa tccaaccttc atttcagagc cgttgcaatt acagtaatca 
141901 gtcatttgtg ttacactttc gataacataa agagaaaact gatcttgagc caaagctggc 
141961 tgactcgagc caattgttag catatgaaac tagtattaga catggacatt tggatccttg 
142021 agtgggattt tggtaggttc tacttgaatc cggatctttt gcgtcagata tttggatcaa 
142081 tatatttaag tacttgaact ttttcggttt ggggttcggg taagtttcaa ccgggtcaaa 
142141 atcgttttgt gtcacaaaac taaatacatg tacagaacct tgagtgggat tttggtaggt 
142201 tctacttgaa tccggatctt ttgcgtcaga tatttggatc aatatattta agtacttgaa 
142261 cttttttggt ttggggttcg ggtaagtttc aaccgggtca aagtcgtttt gtgtcacaaa 
142321 actaaataca tgtacagagc cggtaaattt atgttatatg ttggattcga tcggttcaga 
142381 tatttaggac ccgaaacata ttcatagact tgaaaaatat taaaacctaa aagaatatcc 
142441 gcaaatattt aaatatctga aaatacagta attttatgtg aaacccgatc cgaacacatg 
142501 aaatttgagc aaaatatcca aaatatattt ctaaaatttc aaaattatac tcgaaaaccc 
142561 aaacccaaaa gttgaaacaa aaaaattgta atgccgaatg tatacccatg atacatgaaa 
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142621 tgtttgatgt atacaatttt ttggatattt cggatatccg attgaatctc gagtagtgtc 
142681 aagacccaaa atgagacctg cagatgaaaa aaatacccaa catgtatttt attatatacc 
142741 tgaatctgaa ccaaacatag gttggaatga gtttgattta ggctggattg tacagtatgt 
142801 gggtcaactg gtcgacacgg ttcatgcaaa ctgatagaga gcggaagcta tgtaacttca 
142861 agatcaaggg tttgatcgag aagccaataa cgtatttttg aagataactt caaaaacggg 
142921 ataagagggc gcgaatactc ttcgcagtgt tcgttaaaac tctttagggt tgcgaatact 
142981 cttcgcagtg cttgtgaaag ctcttaagaa aatcaatatc tttattaatt caagtctgtc 
143041 ttacaaaagc catagtagag ttctttatat agaactctta aaccgtcata gaaattttaa 
143101 acctaaatag aaaaggaaaa acataaacct taagtaaaaa ggaaataact agaaagaagg 
143161 aaacttgtcg tttaagttat gatgctgctg catcaggtcc tcccggttga gaaggatttg 
143221 cccacgaatc tggatcatca ttgtcgcctt tgaacggaga caaatgcttg atgttgaaga 
143281 cgtcagacgt acgaaggtgt gaaggcaacc tgacacgata cacattcgga ttgatgcgtt 
143341 ccaccacttc gactggacca agctttttag acttcaattt attgtaatca cgcaagggga 
143401 gtcgttcttt_ agttaagtac acccaaacga gatcacctgg ttcgaagatc agctcacaac 
143461 gcttggtgtc tgctgcaagt _tfcgfc-att tag tcgtagccga ttccaaatga gactgcgcga 
143521 gcttatggac gtcttgaagg ttggtaataa agtcaatggc ctctccatga aggcgagttt 
143581 tgtctggtgc agtcgaaaga tccaacggac cacgaggaac aataccatac acaacacaaa 
143641 aacggtgaga aacctgtact gcgattagtg gcatgattgt gtgcaaactc cgcctgtcca 
143701 agcttagtgt cccacgactt tatgttatct cccacaagac aacgaagtaa attacccaaa 
143761 gaacggttta caacctccgt ttgaccatca gtctgaggat ggtaagccaa gctcatgtca 
143821 agagaagtac ctactaattt ccataacgaa cgccgaaagt gactgagaaa ccgtgtgtct 
143 881 cgatcggaga caataaatgc gggagcgaga gcgagagcga gggcaggagc aagggaggga 

143 941 gcgggagcga gggagcggga gcgagggagc gagggaggga gggaacggga gcgagggagc 

144 001 gagagaggga gggaacggga gcgagggagc gagggcgaaa gagcgagcga gggagcgagg 
144 061 gcgagcgagg gagagatcga gggagagagc gagcgagaga gcgagatagc gagagcgaga 
144121 gagcgagaga gcgacagaga gagggcggga gagagaaaga gggagagaga gagagaggga 
144181 gagagggagc gggagcgggg gagcgggagc gggggagcgg gagcggggga gcgggagcgg 
144 241 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nniinnnnrinn nnnnnnnnnn nnnnnnnnnn 
144301 gagcgaggga ccgggagcgg gagagaggga gggagcaagg gagggagcga gggagggagc 
1443 61 gagagcgagg gagggagcga gagcgagaga gcgagagaga gagagcgagg gaacgagaga 
144421 gagagcgaag gagcgaaaga gagagcgagg gagcgagaga gagagaggga gacagagaga 
144481 gggagagaga gagatcttat aacaacctga tcaagagtaa taatagctgg agagagaccg 
144541 ccatgaacac aaaatatctt gttttcaaca acagctgaaa gactattgaa cacagaaaac 
144601 taattatgaa gattatattc tcaaactgaa attattttat atacatagca aaaaaaatgt 
144661 tgaatagtcc atagagatac cagattaatc catacaccta atctgcattg ttcttagctc 
144721 atgagatatc tcaaccacca acccttgcat aatattttca acgacatcag agactacacc 
144781 ttcagcttca ccaccacaat ccaaccaaag acatgatggc tttatcagat cagcctcaac 
144841 gacctgatcc aatgcttttg gagagtcaca gcagagtaaa ctccagtcaa aacgtctcaa 
144901 cgtctcatgt atcagatctt cttctctaaa ggaagctggt acattgacga ctctcgagct 
14496i ctgagacggc ttgaaaggcc aaaacttgat gtcttgacgg taaatctcgc ggagaatttc 
145021 gtctgtgtag tcaagaacaa gaagaagctg gtcgttgttg tgattaccgt gtgagaattc 
145081 gtctagtagt gatgtttctt cgttgcatct cgccataaga ttggtccagt tcagtcttga 
145141 agcttggaga agttgcttca caaactcgtg aacagagtct ttctcgacaa gatcaaaaga 
145201 taacagtcca atctgctctt ctgcgcatga aaagtaaaga gatagtgaga cttaagtgat 
145261 taaaagatga aatcattgat cgatcagaga _taagagacta actgtttctg aggccaactg 
145321 tttcatctaa catatgaatc ctctctagta cagagacagg actcggttgc tcttgttcct 
145381 gtctttttat actcgactcc tgcaacgatt tagagaaaga gattaaaaca tcaaatccgt 
145441 tgcagagaga catctacggt ttcttaaaga acgtacagct gatgagctct ctagatatct 
145501 atcttcctct agtccgtctt ctacaggagc tctcaggctt gttttctcac tgagagacct 
145561 tacattgtag ctgtgcgrgat ttggatcgaa aacctccaag tcctccttgt ccaaggacca 
145621 caacttgaag ctctctcttc tcttctccag cctctctctt ctctctttga gttttgtctg 
145681 taagctactc agtgctttag gaacttcata aaccggtgaa gaatctgatc tctcctgttg 
145741 cctttccact gtgaacctcc ttggcgaaac atccttctca atgcttagct tgaagacatc 
145801 agactcaaca cgtttcttgc gcctaaacac tggtgactcc gggcttctat agccaactgc 
145861 actgctagaa acatcgccag aagcctcggt gtctttgtta gatccaacaa caagcttcaa 
145921 cttcttggtt agatgtttga atctcaagta ctttctgtcg tcaaccaagc ttggtagtct 
145981 aggcttcaag agaacgatct catccaaatc atcagccagt gaaggagatt tatcacgccg 
146041 ttccttcaca tgaggtctct ccaagtcttg accttttttg gctgagaacg agttaggatc 
146101 ttgtaagagt gtaatgaaca actctttgtt cgagcttaga atatctaatg cttccatgaa 
146161 ctgcttggac cggtcttttc ctttctcctc tgccttggaa ctgatcaatg tttcagcagc 
146221 ttcattcaca caaacttgaa ggttaatctc tgagctcttc tttttacaat ctctcccatt 
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146281 cttcagatcc atttgcttag ccttctcaac ttcactcaaa gaagtttcct cctttgcatt 
146341 cttgatcctc tttatctctc tctctagtaa accctgaaca ggttttgatt catagctctc 
146401 gtcttctctt tcagaactaa aggatcttct cctcgtttta ggcttacatg agttctctcc 
146461 ttccacaatc cgctgaacat atcaagagat tcagttttct gcttccaaaa aaagccaact 
146521 caaaaaaact aaacaatttt gtaaaaactt acatcatgac actgacatct ctcatgaagt 
146581 ctcttgagca aatcaagttg gcttttagta taaacattac ctgcaaacaa acaagaagca 
146641 aaattttaaa actaagatca tgaagtgatc caagaaacac acttaacagt actactaacc 
146701 agcagagtct ctgatgggtt tcttctgtga actcagtctc ttattaccag atttaacatg 
146761 tctgaaatcg aagaagttaa tcagtttcca accacaacca gacttgtaac tctcgtagac 
146821 aagagggctt ttagggaaaa ttggagttcc aggactcttg ctgcttcttg gtgatctcct 
146881 tttctccatt gattatcaaa taactcaaat cattcaaata tcatgtctac ttcaatctag 
146941 taaaaaaaaa aacgattccc aataacctga aacacacaca gaaaccctga tcttttagac 
147001 aaaatatgaa aaataccaac ataaacaaaa aaaaaacaga ggattcttga agttgtgtcc 
147061 gaaaattaca tcttcaggat acaaactgtc aattttactt tctgggtttc atttttattc 
147121 cctactctaa accctaaact gagatttcag tttcttcact aaaaaagcct tc.ctttagtg 
147181 taataatgta atatgtacat- gtatatatgc gcttgtacag acttatctat cagagatttg 
147241 cttaccacta agctttgagg ttgaaatggg acagaaaaaa atgcaaactt tttctttata 
147301 cagtcagtgg agccaatcaa gcactgagct ttgtcatgga ctgatgaact ttttttaaga 
147361 gttaaaaaaa ccgcaaattt tgtttccaca tacaaaagaa aaagggctaa aacatgattg 
147421 aggttgaaga tcttttgcga aaattaaagg gacccttgtc tcttcctcta cccttctaat 
147481 taaaaaaaag gatatttttc tgtattcaag aaaaattgta aggacaataa gtgaggatta 
147541 ctgtttattc ttggagatgg gtcctctcac cattaaagag gcttttagtc tcgtcctgga 
147601 gaaggaggga gaggaggagg aagatgaagt aagactcttt ataaaataag atatatgatt 
147661 ggaagagaaa tgtaaataat caaaagacca attgaaaata gatctttatt agaccatctg 
147721 acatttttat tttgttttct ttttaacagg caagcagaca ttttcttgac agtgggattt 
147781 ttatttattt ctagtaaaaa tttacaacta cgtttttaat tagcaaaata aataagcgaa 
147841 aatgaagggc aggagaggga cgtgagaatg aaagagactg taattttgca gatcgtttca 
147901 catataagac aatgctttca aataaagctt ttatttctgt catcttctta tctacaaatg 
147961 catgggggaa taggatattc atgcatcaat ttagctaaaa ttagataatt agtatttgaa 
148021 ctcagaaatc attcaaacct aataatttag ctcgattatt tctaaactta gatattcaaa 
148081 tttcaaaatt atgatatata tcttgatgaa gttaaataat gagttggatg ctaaactaat 
148141 tacaacatca tgcaatatat ataactgagg caatagagat cttatatctg atgtgttagt 
148201 tgaactagca tgtcattcat cattgaactt tgtcaacaga gatattaacc ttagtaaatt 
148261 ttgaatatgt aattaaaaaa aattgaattg acgtgtatgt ttgcacataa tttagcctta 
148321 agaattaatt gttgtatttg aaataagttg agatatgaat aaatcattct ccctataaat 
148381 gtgttttgtg tgtggacaag acttggattg aagacttgga ttcaccctct atggtaaact 
148441 tttaaatttg ccatctcact aataaccaat caaagtgtca actagattaa taattaaaaa 
148501 atattaaata actaaccaaa atagttagaa aaataaatag tgtcaatttt aataatgcta 
148561 atgtctctat ccaatccata aactcaaact ctaaacccta aattctaaat tataaatcca 
148621 aattctatat cataaatcca aatattaaac tctaaaccca aattttaaat cttaaaccca 
148681 aaccataaat cataaaccca aacctataac ctaaacccaa acctataccc taaacccaaa 
148741 acataaatac aaaacccaaa ccctatactc taaatatttg gattaatatt gatgttgttc 
148801 ttttgaaatt taaggttaag aattaaagtt tgggttacgg tctatagggt ttgggttacg 
148861 gtctagggtt tggatttagg gtctacggtt tgggtttaga gtttatggtt tgagtttaga 
148921 gtctaggatt tgggtttagg gtctagtgtt tgagtttagg gtatagtttt gggtttagat 
148981 ttagattttt tggtttataa tttatggttt gggtttatgg tttaaaattt gagtttaggg 
149041 tatagagttt gagtttataa tttaaaattt agagtttgga atttggattt aggtcataga 
149101 gtttgagttt aaggattaaa ttgtgatatt agcattatta aatatgacat tatttatttt 
149161 ttaattaaca aatattaatt tagttggcat tttgattggt cattaggaaa gtggtgaatt 
149221 taaagattca acataggggg tgaacccaat ttttatcttt tatgtgtgtg aatgtttgat 
14 9281 gaaacatata cattacagag ttgaaaaaaa agaaacatat acattacaga ttacagtagg 
149341 tttatgtggt taaaacattg atttatcact gtgcgtaaaa ggtctgtcac aaaagtgtac 
149401 cttttctttt tggctcaaac ttttcctttc tttttatcgg aaatataacc gttagaatag 
149461 aactcaagta aatgaaataa gtagtacatc tgcttttaat gtttgttatt attttagtcc 
149521 taggcctcgg catttcggta tcggttcgat tcagatattt cgaattttag gtagttcgga 
149581 tagaaggtta gaggattcac tgagtactta acttcttttc gattcggttt agttcggata 
149641 gtttggggtt cggttcgata gtaaaactag aaaccaaaaa gtacccgaaa taattttggt 
149701 tctcatttgg ttccagttcg gattcggata gctcggataa atattagtta tttaggataa 
149761 aatatcaaat aattatgatg atttaaataa aatgttagaa tattttggat tattttggat 
149821 atttcggata aaattatccg gatacttttg gttaatttgg atatttcgga cattttataa 
149881 taatgtagtt atcttcaaat atttttgaat tcttttaaga aattttttaa ttataaatat 
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149941 atatatatat . acatttagtt atgttatatg tatatataat taatattttt atatatttgg 
150001 gtatccgctc gattctcggt tcggttccgg ttcggttatt tcggatatag aaatatagac 
150061 accatacgga tatttgaagg tcctcggttc ggttcgggtc cggatatttc ggtgcggttc 
150121 ttcggttccg gtttttttgt ccaagcctag ttagtccttc ttgaacattt accaattact 
150181 agttccaagt acaaaaatga atatagttta attttcttca cacactaatc ttgttaacta 
150241 gtaacgtaaa gactcttata cttcccatct atgtcatatt acgtgttgac acatgcacat 
150301 ttaatacgtg tgtatacatt tatatatcaa caagttaatg tgtatatttc aatagtatat 
150361 atttatagac tggtgaacaa aataatataa tctatagagc agccgatccc aatactgcat 
150421 gttctttctt cttttctttt ttcatatttt gctatcgcgt ctatccgtct atggttggtt 
150481 atcttacaag tatcttttta gaaattatta tatcaaacta cgcttattct tcagatgata 
150541 aacgaggaaa taaaatacac tggaatatat tactaacaca atagtacaaa aaatgaccac 
150601 gtacttgttt tagtagttat attccaaatg ccaatgcatg ttcacgaaga atctggcaac 
150661 tttatttagt cctaatgagt tatgtgcgtg aatgaataaa tcattatgag atctacaatt 
150721 ctacatgcgt tttttgtatg atcaaatatt atattatatg aaaaatagac accaaatatt 
150781 tatacatata" atttatatag gtttcaattt tttaactafcg acttgttaat tgtftattgtc 
150841"" gtagttgaga tggtgacaac gactgttttg gacttttaat tcaccattca cgaacttttg 
150901 taattatcgt ttttgcatag aacaaagcat tggtcttttc aaatgcgtgt attttactgc 
150961 tatcggagtt gtatatcaat aatgtttttt atgaaaaata tatttaaata aaactataca 
151021 tcattgaaaa tgttattatt tttctttact atcagtcagt ttcttatttg cttatttatt 
151081 ctgttatttc actagtgaca aattcaagtc ttttgcaaac gtgtaagttt agcttaagtc 
151141 tttttttaag tctaagtcaa ttaagaatta taacttatgt attatgttag tacaatctaa 
151201 tcgtatttta taaaaatggt gtataccatt aatatgattg gttgactagt aacggtatac 
151261 gcgtggttgt ccacttatct taagctacta aaaaaagtta ttaataatca aattcatcta 
151321 actttacacc aaagtttctc tccgccgtcc aatagaatat tactatttat gccaagttgt 
151381 cataatacat acatatgata tattacaatc gtttgtcaat tgtcatacac gtaaattcag 
151441 tagacgaact attttagata gtgtgccgac aaaggtatca cgacgtatta catcttctca 
151501 tcatttactt tatttttgta atatcaaaac acgaaaaaat cgaccaatat ttccaacttc 
151561 cccaaatata caagcaagac aaatgcactt tttagaataa gaaactcatg cagtcatgtg 
151621 aagcaaagtt gaatgcagaa tatgaagttt accaagcaat taattaagat gttatcatac 
151681 aataagttac atgttagaca caatataagc tctaaattct taaggaaact ggagcactgt 
151741 ttatatatat atatatatat atatatatat atatatatat atatatatat atatatacag 
151801 tattgcctaa ttaatgagag tttctcattc tataaactaa tacagtagaa cctctataaa 
151861 ttaataatct ataaattaat aatctctata aattaataaa ttttagcggt tccaagttgg 
151921 gccggtgtta aatataacac aaatcgataa aataataaga taataatatt tttaaaattt 
151981 ctatgtaaat atatagtctc cttaaaatca taaattaata atttatctat atataatttt 
152041 atataagtta aatattatat tgttggtttt atattcacaa tgaaaattct ttacttttct 
152101 taacatttca atatattttg cataacattta gtaaaattat accaaaaaca acttaacaat 
152161 tctatgaaac ataaaattat acataaaata agataataaa acaatattga agtcaagttt 
152221 ataaaattta aataatgtac atacggtaaa ataaaatatt ttctttttta tataaataaa 
152281 aatagaaaaa tagaaaaaat ctaaataaag aaacttttgt aaattaatat ctctataaat 
152341 taataaaatt tcaaagtccc aacattatta tttatagagg ttttactgta taatgttata 
152401 ttacgatata atatgtcaag attaccaatc ttgacatatt atatagtaat atgttaatta 
152461 atcaaactac attgcttata aaaacaccaa cgtattaacc atatcttgta acacgtatta 
152521 tctaaagact gtgacatggt tgagttttag ttttattaat attgtgaatg gcagtcgcat 
152581 gtagtaatgt acacgatacc cacgttaatt aagtcatttc ttggtatttt atgatacatt 
152641 aagaaaacag tagttatatt acttatgtac "atttagcatg tgtacctcac gtgttatgaa 
152 701 atgagccggg taaatgaaga aacgaggctc aagtggaaaa gggaccacaa agggtgttat 
152761 gtattaaacc aactcaactg taaatggcgg tggtcgtttc aaattaaagt ggccatgtga 
152821 ttttattttg ttccgtcagt gaccaagata ctggacctgg ttaacataag aagaatcaaa 
152881 tagaactctt tcaatatact agaacttaaa aagaaataca aactagactt cattgcaatg 
152941 ttgatttgtg gctggtgctt gttctgttta gtttgaacat ctacagctaa tagaaaggtg 
153001 tctatcattg ctcaaagtgt tacaaaaaag ttttggatgc agttatatat atgtagctac 
153061 cggtgaccat tttagctgct gccttctttg accatgagag ggggatattg aagggaaccg 
153121 acccattttt tcgtaccatc agtacaagtc tggttcgact tcctcgatta gatttaggcc 
153181 gttttccaat ttttctatat tttaaattat ttccgagttt tagttcattc tcacacaagt 
153241 caaacaacca taatacgatt tgaatagata atcagagatt cgagctaata agcgagtttg 
153301 ttataagcta ttacatgtta aacccgttct ctatctaccc aaatccacca tcttttcttc 
153361 ttgccttaat cccgcgattc aattatgtct ttacctttct tgcatttgta actaaatctt 
153421 ttatttaagt ttgtaaactt ttattaaaaa ttaagattga aatcaaacct ttatttttgt 
153481 tatatatttg agatccatat attgtactgt tttataatca catatataat tcttcaataa 
153541 taatataatt ataagtacca atttgatttt aataaaattt aaattagtaa taaagtttca 
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153601 acaaccaaat tttctaaatc gataattcta cagccaaaac ataaagtgaa aaatcttacc 
153661 tgtcaaaccc aaattaaaac aatagtcaaa gtcttgtctt aaacaatcag agcctaaatc 
153721 atttgtaatc ttactctatt ttttattatt ttataatata gtaaaaatta gattactata 
153781 atgaaatact atttattttt .tactcattac tctagttttt ttactttaaa acaaaataaa 
153841 gttatagtaa aattctaaca cctttataat ttgtaaaacc aaaataatat tttacacaag 
153901 aggtgctctg gtacgaaggt cttttcagag tgttggatag tttttctgct tgtgctcaag 
153961 cgtctgtgca gcaacatggt ctgttctgcg tccttcggga tatgcctctc tacttgggca 
154021 tgtcgtttta tcctacccaa tggtttcatt tggtcggtgt ctatgtttgg ttctagtgct 
154081 ctacttatat ttttgagatc cttacatcgg gtgtaatatt cgagtaaggt tagttcatgc 
154141 atgcaaaccc aatcgtggac tagagcagtg gattagtcgg atcagtgttc tcgtggcttc 
154201 ggtaaattgt gaacaatata ttaagaaact ttatgatccc taatctcttg ttttagtaat 
154261 gttaagtggc tgaagtgaag tctttcaact gacatcattt gttgctaatt catcctagtt 
154321 agtagtttgt tgccgtatct ttctggtctc gttgtttaga tagtccgtag taatcgcgta 
154381 tagttctttt gttcaggttt ttgtctcttg aatgttcttt ccgccaactt tctggctttg 
154441 tctttctaac tttgccgattttctattgta actcatgttt tat ttcttca~ataaacaacc 
154-501 aaagttgtag tagtccagtg ataatttttt tggagctcag - cgtgtatcca tatttgttct 
154561 gagtttgata cttgttggaa actaagtatc actgcttagc cagccagtgt ttgggtttcg 
154621 accaaagtat cactgtttag ccagtgagtg tttgggtttc gaccaaaatg atttacaatt 
154681 ggtctatccc taacaataag gaatgggcac aaaccaaata cttgctgatt ttcaaatgct 
154741 tgatacttgg cttggttcgt tcaaataatc atattcacga tttggaattt acttgtttaa 
154801 aacaaagtac ttactatatc aagtacttaa gagaaaaata gtacttgcaa gttacttgat 
154861 ccgatttatt. gttaaaatca agtacttgtt ttaaaactat ttattacttc aaaactgtaa 
154921 gtaaccaaaa aaaaatctta tactccctct gtttttcaat atatgatgtt tttattgaat 
154981 gcacaaagat taagaaattt acattcttgt aaaaagaatt ttaaaatata tgattttaaa 
155041 atcattcaat caattataaa atttttattg gttgaacagt tttcaataaa ataaaatttt 
155101 accttaaaac ctcaaaatat tatatatttt gaaacaacat aacacttcta aaacatcata 
155161 taatttgaaa cggagggagt aaaaaataac tagacgaatt ttgctatctt tttttaatca 
155221 aaaaacttat tttatcaaat ctaatatgtt tatctgattt ttgttacaat aaaagacaat 
155281 atatttttaa tagataatta aaaattctag aaacaagtag cgaataatat gtaaacaata 
155341 agtatataac aagttatacg gcttattatc cgatttaaca atgtcaaata cttgattttt 
155401 aaagcgaaac aagacaaaca agccaagtaa aaagaatcat ggctcacata gtttttaaat 
155461 ttaagtggat taaggttttg aaactgtaaa aacttatgcc aatgttgatt aaacaaaaag 
155521 atatatttag gaacgagtaa caaatccaaa caagtcgatt aacttttttt ttaaagactt 
155581 gaaatttggt ttgtatttgc aagtttttta tttgaagtat ttgttacttg accatcactg 
155641 cgaatacttg attttacgga gaaagtacaa gacgagttgc gagtaaaagg cgaatttcca 
155701 gtatttgtgc ctagttatac ttaacatcag ttggaaagta ctccaaacct aactaaacag 
155761 cgtagtgaca gcttcgcact aaatccattt tttacaataa gtacattccg actagaatag 
155821 gaatcgaccg gatagtgaaa atgaaaaaaa aaaagaaaag aaatctccaa taaattccta 
155881 agatgacaag atacaggaat caaatcaacc tattatctgt taccatttta agtgggctat 
155941 tttggtgggc ttggtcatta ttacgccaaa cccacctcgt atcactaaca ctcgaaacgc 
156001 accgtattgt cgtttggcaa caaaaaaaaa ctcttatccc cttcgaaaaa cctttcttca 
156061 tcacactccc tctcctgtct ctccaatggc cttcactttc ttctctcctc accccgtctt 
156121 cctctctcta ggtcgaacca cctcttcctt ctcctcctac aaaccggcat actcaccatt 
156181 ctcccgaaat ccccgtaata atcttcagtt agcggcgggc cccacccgtc gtagctctta 
156241 cccgaacccg gcggatgacg acccgcccga agccccggag gattcgatgc acggcgtctc 
156301 caagtttcag cagatacagc gccaagccgc tcgagcgcgg aagctggagg aagaagactt 
156361 cgagaagaac cgaaacacgt acctctcggc catcgctgac gtggaagacg cgcccgagac 
156421 gggacgcgat gacgtggagt ctggaggcga tctgttctcg gacatcgata gagccafctc 
156481 aatgaaacgt agcgagttcg ttaaaaaagg actgctccaa cctaaccctc ccaaaacggc 
156541 gtcgtctaag aagatcgacg aggaagaaga agaagacgac gctgttgatg agcttgacga 
156601 agaagaagct gtggatttag acgagatcga taaactgact ggattaaccg aagcttccga 
156661 cgaagaagac tgggtcgacg aggaagggaa ccctaggatc atcagcaaga agaaggagca 
156721 tcaattcgag ttcgatttgg atgatttcgg cgaatccaag gcgagaatcg tggagcctaa 
156781 gttcagactg agcttagccg agctcttgga cgagagcaaa gtggtgccga tctcagtcta 
156841 cggcgactta gacgtcgaga tcaccggaat ccagcacgat tcgcgaggcg taagcgccgg 
156901 agatctcttc gtgtgctgcg acggaggaga cgactccgtc ctgagcgaag ctgacaagag 
156961 aggagcggtg gcgg.ttgtgg ctagcaaaga gatcgatatt gaagatacgt taggctgtag 
157021 agcgctcgtc atcgtcgaag acaccgaagc agtcttggct gcgttagctt cttcgtttta 
157081 taggcatccg tcgaaggaca tggcggttat cggagtcacg gggactaacg ggaagacgac 
157141 caccacgtat ttgattaaaa gcctctatga agctatgggt gtgagaacag gaatgttcag 
157201 cagtgtttct tgttacgtcc atggagataa caagatggat tcaacgacga cgagtcctga 
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157261 tgctgttttg gttcagagta tgatggcgaa gatgttgcat aatggaaccg aagctctggt 
157321 tatggaagct tctcctcaag aactcgcttc agggaaatgc gacgaagttg atttcgacat 
157381 tgcggtcttc acgaatttag ccagagagga tagtggcttt cgcggtactg atgaggagta 
157441 tagagatgct gaagccaagt tgtttgcaag aatggtcgac ccggaaagac acaggaaagt 
157501 ggttaacatt gacgatccaa acgcagcgtt tttcgtccag caagggaacc ctgatgttcc 
157561 tgttgtgacg tttgcaatgg agaacacgaa agcggatgtt cacccgttga agtttgagct 
157621 gtctttgttt gagacacagg ttttgctcaa tacacctcag ggtatactgg aaatttcgtc 
157681 tggtttgtta ggacggcata acatttataa cattcttgct gctgttgctg tcgggatagc 
157741 tgtcggagct cctcttgagg atattgttag aggtgttgag gaagtcgatg ctgtcccggg 
157801 gaggtgtgag ttgattgatg aggaacaagc ttttggtgtt attgtggatc atgctaacac 
157861 acctgatggt ttgtcaaggc tgcttgattc ggttcgagag cttaagccaa gaagaatcat 
157921 tactggtata tacatcttac gtctatttaa tccatatcca ctagttctta tggtgtttat 
157981 atgtattgtc gtctgatttg ttaaagttat tggctgtgcg ggtgagactg agagagggaa 
158041 acgaccggtt atgacgaaaa tcgcaactga aaagagtgat gtgacgatgt tgacatctga 
158101 taatccgggg aatgaagatc catgtgagta aatctgagat tagctcgagt ctataaccct 
158161 tgatgtgtaa caggtagaaa actctttaaa tgtgatttca gtggacatat tggatgacat 
158221 gttggctggg attggatgga cgatgcaaga gtatctgaaa cacggagaac atgattacta 
158281 tcctccattg gcaaatggtc atagactctt ccttcacgac attagacgtg tagctgtgcg 
158341 ttgtgctgtt gcgatgggtg aagaaggtga catggttgta agtatttaaa • taaacccatt 
158401 ctctgtacac cttttttact ttccctggtg tgttattaag gttgctggat tgaaattctt 
158461 taaagtaatc tcagcttcta agttacatta tgcaatggtg tttcttgaca ctgatttact 
158521 ttctttgcag gtggtagcag ggaaaggcca cgaagcgtat cagcttgaag gtgataagaa 
158581 agagttctat gatgatcgag aggaatgtcg ggaagcatta caatacgttg atgagcttca 
158 641 tcaagctgga atagacacaa gcgagttccc atggaggtaa gtagaagaga aacagcctct 
158701 tttgccaaat gattaaacag agattctgat ctgtgagagt tactgaaaca tgtcaagatg 
158761 cttcctctga tcacactaaa tgttctttcc tcgtattgct ctgcttttgc aggttaccag 
158821 agagtcatta atatctctct atggtgagat cattaaacca tgaatcagat ttacttccac 
158881 cgccaagata gatttgcaga gagctcatga attctgtacg agagacttgg aggtgtttaa 
158941 caaccctttg tgagtcatca ggtaagtgta ggggagagaa gagattctaa aggtggagga 
159001 agatgtgtac aaatttttgt tagtggttgt tgcgtttgtc tgtgttgagg gtttaagata 
159061 tggcaggtgt tttttttaat aataagcctt cattggctcg aaattaaatt ggatgagaac 
159121 ataaaacaaa atcaacacaa atgagtagga cataccacac actgcaatga gaaggaataa 
159181 ggcaactcca tactaattag caagacctaa ccaaaagaaa accctatggc gaatatggaa 
159241 ggatggtggt tgttgtttca aggtagagga ctaaggagtt gcagaaggag gttctgattg 
1593 01 caagagggaa gatggagcta cgaaaagaag caaggtcata gccgaacgga gttggatgtg 
159361 aattcagtgt taaccgaaat tggatgtgga atcattgtta accggagttc gagctggaga 
159421 gagtgcagct ttcttttttt taaccacgag ctggagagag tgcagctttc tctgctcgtt 
159481 ttcttctttg ctttttctgt ttttaaaaat ttgatgaaat cacagaacaa gtgataaaaa 
159541 aaaatctatg aaactgttgt taattgtagc aataagaatc aatagagaag cgtagctaca 
159601 ataatcataa ccgatggaat tcacctacaa aataaaatca acaaggcctt gtaaatttgg 
159661 aaaagaaaag tgtctttcat tctttcattc atcctccaac tgataccctg caaaacaaac 
159721 acaccaaacc attcatttaa gaaagtttcc ctgatgctac agcataagct gcagctcttc 
159781 tccaaaatta taaaggtaag caaagaaatt tgaaaacaaa atgtgttgaa tgacataccc 
159841 acactcatct gcagatcctc aagcattgcc actgcccttt ctagttcctc tttactccaa 
159901 aaaccagtca gcatattgcg gatagtaatg gtatcaggat acacaccact tgaaatcatc 
159961 tcctggaaaa tgtctagagc cccattaata ttacccactt tacgaaaacc ataaatcaaa 
160021 gtgatgtaaa taattgcatc agcaactatc cctcttcgac ccatctcgca gaaaagctcc 
160081 agcccatcat. caacccttcc tgccttacag tagccattaa tgagtgtgtt aaatgtcact 
160141 acgttgggag agaagctctt gctacccatc gaaacaaaca tttgtgtagc ctcatctagg 
160201 cggctctgct tgcatagtcc atcgatcatt gagctatagg tgatagtatc tgggactata 
160261 cctctgtgtg gcatctcctc gtataattcc tcggcctcta aaaacttccc ttcattgatc 
160321 aagccgcaga tcaatatatt gtaagttaga acatcaggtt ccacaccatt gaaggggtga 
160381 ctagcatcaa gatccatctt actcttctgc atagccttaa acatttccaa tgcatctttt 
160441 agtttcccat tatcgcagag accgtccagc aaagtgttac aagtaacgat atcagggcac 
160501 acaccactag aaatcatctg ctgtgaaagg tctagagcag cattaagatc gcccaccaga 
160561 cagaacccgt gaataagagt gttgtaagta actgtgttag caactaatcc tcttctaggc 
160621 atctcatgga gaagttccat tccatcatct atcctcttag ccccacaata tccgtctatg 
160681 agagtagtga aagtgaatac gtccggagag cagcccttgg tagccatcaa ataaaacatg 
160741 tcctcagcag catcaagacg atcctgtttg caaaacccat cgatcattga attatatgtg 
160801 attgtattag gaatgatacc ccttggaagc atctcatcgt ataattctgc agcctcgaag 
160861 aacttgcctt ccttgacaaa tgcattgatc aaagcattat aagttacaac attagggctg 
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160921 atcttccttt ctaacatttc ttgcaacaac cgctgggctg cactccatct accagagatg 
160981 caaaatccac cgatcataca gttgtaggta actatatttg gaaagattcc cttgtcttgc 
161041 atttcaatga aaagattatg agaatcgcta tggcgtccat ctttacaaag gccatcaatg 
161101 atggcactat agataaccac attgggtttg atgtggctta tctcctccat cttcctcaga 
161161 agattcaatg cagacacagt gtcgcccatc ttacacatcc catctacaaa tgttccgtaa 
161221 gtaatctggt caggctggag accattttcc accatccgat caagcagagc tacggcttcg 
16128i acaactcgac cctcgcggca aagaccattc atcagcgtgg tgaacgttag gacatctggt 
161341 ctacaaattt gatgaaacaa atccaaggct tcagaaaccc tgtgatcaag acataatccg 
161401 tggagcaggg tggtgaaggt aacaacatca gggtggagtc caagcttggt gagcttacca 
161461 aatgtagaca aagcaaaggg gagcttagag caactgcaga aacattttat cagaatggtg 
161521 aagctgtata tatcacatcg aatctgtttc ctttccatct tttgatagag agaaatcaca 
161581 agatccgggc gttccattct caccaccgca cccattagct tgttgaaatc aatcacagaa 
161641 ggtaaaggac gagatcgaag catgtcactg aacaaatcaa tcgcatcctc taaccctttg 
161701 atttcataag atccacttcg cagtttcaaa ctctctcctc taaaacctgc ttcgccactc 
161761 tctccatccc tgcttttctc ggccagagca tgacgaatcg atcccgCaca gaaaaatcta 
161821 gccgcagaca cagaagacga agaggaagac tcgaatctgc aaaccctagc caacatttcg 
161881 cctgagatta ccaagaagcc aacgacgagt gagtgaggag tggtcaagtc gtgaactttt 
161941 tggcttgagc gactataacc aaatcactga tttgaaatca gaaaacaatg accgtgtcaa 
162001 ttcggttggt taaataatcg gttcaaattg attttacact gaccggttca actcctgttg 
162061 ccaaaacggt ttaatcaaag ttggttcact tgggatgaac cgggaaaagt gcctattcca 
162121 acctgaacaa tttggatcgt gccaaataca acccgaacta atggttagtg ccaaatacga 
162181 cctgaactca attaaaactt taaaaaccta cccgaacttt ttaaaacgtg cataaatcta 
162241 cattgaaata ttattaaaaa gttgaaatta cataaattat tattttttca tcgtattaaa 
162301 ttttaaaact ttggtgataa ttttttctga tttatataaa tacataacat gtttttactt 
162361 attatatctt caataaactt gtatattact aaaatataaa taataaaata tttataatta 
162421 aattatctta gatatactta aaattttaaa gttatttata aataaaaaat gtatagatta 
162481 tttttatttt ttaaacttaa gtgaattttt tttcaaattt attattagat attttaatat 
162541 tttgttcaaa atttttgaag ttatttatat tttgaacctt ttacctttca aaattgaaca 
162601 aaatattaaa atatataata ataacttttg aaaaaaatca cttaagtttt aaaaataaaa 
162661 ataatctata caagaaattt ttataaataa cttcaaaatt ttaagtatat ttgagatcat 
162721 gtaattataa atattttatt atttatattt tagtaagata taagtttatt gaagatataa 
162781 taagtaaaaa catgttatat atttatataa attagaaaaa attatcacca aagttttaaa 
162841 atttaatacg atgaaaaaat agtagtttat gtaatttcaa ctttttaata atatttcaat 
162901 gtagatttat gcacatttta aaaagttcgg gtaggttttt aaagtgttaa ttgagttcag 
162961 gtcgtatttg gcactaacca ttagttcggg ttgtatttgg cacgatccaa attgttcagg 
163021 ttggaataaa cacttttccc gggatgaacc ttggccagaa tcaacctcgt acccggtctt 
163081 actcaactct tttaggtaac taaccatttc tagaatcaga ataaaattgg ataaaaacct 
163141 aattattcat cagggaatta cacaagataa aaatggataa ttagttcttt actttgtaac 
163201 taaattttaa cccgcccttt gatggcgggt aatttttgtt ttaatttttt tttgagattt 
163261 ttgttttatg tttttgtaat tatatttgta attattaatt tagtttgata taattttggt 
163321 gactatatta aatatgtcaa gtacgtaact atatacttgt actatatgca tttcaaagat 
163381 tattttaaac tttatttcta aagtttacaa catattatat tttttagtag ctactaaata 
163441 taattagtca aaaatattcg tataaaaaaa atctgacttg gaatcgatcc gaaaattaaa 
163501 gttttatatt cagtgttttg aaacccgacc cagatccaag attgaaccgg taaatccggt 
163561 gatccaagat aaatcagatt taggttttat aaaaaataat attttaaaag tcaaaaatca 
163621 ctaaaacttg gaacccgatt agtgactaat agataacttc tactttttat tttagttttt 
163681 aattatgttt ttaaaatctg ttttgtataa cttttagtaa ataaattagg ttttttaatt 
163741 tttctatcga ccatttatta ataatatttt aattttatta gaaatagaat atattttgta 
163801 taatgtattg ttggtatatt ttgaggtttt tctattttgt gggtgcattg gttttacgat 
163861 ttataaatta tatatgcata ttctttcatt tggttgatgt aatcgtagat ggagtagatt 
163921 tttttttttc cgtcaagatt ttatattaaa taaaggccag agcccaaaag cttacaacca 
163981 aacgagcccg ctaacttcgg gcccaaatta caatccgaaa cgggcctaca gcccaaacaa 
164041 accaaaaccc tagcgctacg gaatgctcgg gatgtcagcc gccgcgtaga tctaaccttg 
164101 ggagaatcaa gatacgtgtt. cacatcctcg tcctcgagag acacgcgtca cgcaggccag 
164161 cacaccacca ccgcttcacg ctaccaccgg ggaaccacac ctcagttcac cggaacccaa 
164221 ccggagctca caaactccac acatcctcat cgcgaacact ttttcgatct aggatggctc 
164281 taatcggaga gagcatcaat cgtcgctacg agatctcatc ccccgtagtt tatcgcacac 
164341 accctagaga acgcttcaca tgcatcacca accttcccgc cggagacctt caccctcctc 
164401 cgacgagacc acggtcaacc aaacccaaac gaaccacaaa gagaagaaaa gaaaaaaaaa 
164461 aaaacctaaa ccctaataaa ctaggagccg aaagccggcg gcgcaagaca gtggagatca 
164521 tcacaccccg gagactagaa ccgatgacgg tggaaccgaa gaggcttccc cgtcgcggat 
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164581 acgaaggccg gcgtcgatgg agctaaggta gcctccatct cccagaagta gacccaaaca 

164641 aaagatgtct tcaccgcgcc atcaacccac caccgcgtcg ttgctccaag aacagaacca 

164701 ccgctgaggt ggctggccag agttcggaca aagcagatgc cggaggagat agagaggaaa 

164761 aacatcagac taaattttag cttaaagaaa acagacaccg acggcggcac ggacgctaac 

164821 gcaccgaccg accgccgggt ccaaaccacg attcactttt tctctctttt ctttctcttc 

164881 tctctccaaa ccacgattca cttttttgtg ccttgttcta gatggagtag attaagttgt 

164941 tgtattttta atgtaggttc tgactaatat tagttgaatt agttagtata taattagggg 

165001 aaataaatat aattatagga agcacaaaaa gtaatatgcc aaggaaaatc aaaggtaaat 

165061 ctataaaagg tattctgatt taattatatt gatagtataa gaatcgatac taaataccgg 

165121 tttacaagca acaatacgct tttagttcgt ttctttggca agaacagaat caaagcgttg 

165181 tgagaacaat cagcaacttt tgatgatggc tatactgcag cttttccaac acttttggtt 

165241 ccataaatat gtctcatatg aacttgaagt taagtgcttt ctcaagaaac acgcttcttc 

165301 agtgcctccc caataatcac agtccacaga tcagatcagg gtctgcaacc catgtttaat 

165361 attgttggac atcagggcat aagcttcctt gtcacaacct gaaaacacac tcgaatcaat 

165421 gatttgaaat agccaaatta aatagagaac gttaaggatt ataaccaatg gagttaagaa 

_165481 cttaagatca ctactgatga tttagacaaa agaaaggaca gaccctggat ttcaaataaa 

165541 ccgaacagag cttcagtaac gccatatgca accaatagag aaacaacaca ccaattttct 

165601 gtcattttcc tttttcagat ataactaaaa ctagacacat agaaagtttt aagatgtcca 

165661 ctctagtagc cgtgtgaaaa tactagtaag cttaaggtat aataagtatg agagcttttc 

165721 accactctaa caagttatat cgctgttctg attgtgataa agttattgct acccccaaca 

165781 aaaaaaccga gacgttctta ctcaaataat ggctgtatta tcttcttttt gcaatttcac 

165841 tcttaactct ccctgctacc atgaaaccgc atggatctgc attgtcaatc taaacaccat 

165901 gacgacctca aaatcacagc cacgcaaaaa caaagacggg ttcacaaatt acaaaccaac 

165961 tgattctagt aacgagaaat gaaggacagc acaatggagt ttcaaatgct tgcagcaatc 

166021 atattaatct caatgcactt tggattgtca catgttgggc atctcagtta cccatctaca 

166081 cgttgatcaa caagagcagt ttttctgttt gctaagaaat caacgagagc agtttagtcc 

166141 cagctctaat acaataatcc caatacgtta acgtgttcat tatccactaa aaactgaaat 

166201 tcataaaata cacaattgta tatatcatcc gggatgattc atccccaaaa ctccgactct 

166261 aatactaaaa caggataagc aagaccacct aatctaaata ttaagaagga aaaacaaatc 

166321 agaatctaga atcgcaggga aataacacag accctatttg cctattctca tagcttcatc 

166381 ataagcatca tatacatata tattcaaatg cgatgattac acaggttgtc acaagaaaag 

166441 cattgaagta ttgttctcac ttactttaac tcagttttcg tcaccttaca caatgcaatg 

166501 ttgactcttt ctttgatcca accactggga gtaatctctt aaaaaagtta gaccaagcac 

166561 tgaaaacaat tagaatataa catttgcatc tcctcaccca gagagatgac aacacaaata 

166621 cttcttcatc ttggacattg tgtgatgttt gagagagaga tgcaaaacgg tggggttccg 

166681 gtttataagt cggagccaaa ccgttttagt atcgattttt agttcatatg atcaatctct 

166741 actctctaac cggttttact atcaagaaag aaaaaagaga agcttataga gtttgtattt 

166801 ctcgtctctg gtttctttgt tgattaacaa aagcttaagc aataatagaa tgaatctgca 

166861 agtttattct acgtatggta aattgatttc atagcaattt ggagataaag aatgaagcta 

166921 agcaagtcaa gaggcgaacc agagaagcat aagcgacaac aacatttcca ccaagattaa 

166981 aagaacatga tctcctcctg aacccacatg ggagcagaca aggcaggaca gagccaatat 

167041 gaagaagagg aggaagatgg catcacagct atcagctctc atgaagaccg agcttgacga 

167101 taaccatggt gacagctaca gaacgggcag acacctaaac atcttttaac acctgaaatc 

167161 ttctggaagg tgtgcctctt tagttcagac caataatctt gcgtgaaaca agtgtatgga 

167221 atttaattgg aaatggtcaa atttattaag cccagacaat agacgcataa aaggctgtca 

167281 ttagaatagc agcaaaataa cttactaacc acacaatttt tcactatgat cactctattc 

167341 aacgatttga tcaaacatta gtagggtata catgcagcag cagaaacaaa atgacattga 

167401 agctctgctg cgcaaacaga agaaaatctt tgatcattca tcccccaaat gatagatcct 

167461 acaatgcaaa accaaaccaa tggttaattc cagaacgtgt atcatcaatc tgaagctaca 

167521 cacaaaagaa tgtatacatt cttgtttata taaaaaatac atagactgaa cagaaactta 

167581 cataccatac tcatctgcag tttctcaagc attgccactg ccctttttag ttcctcttta 

167641 ctccataaac cagtcagcat attgcggatg gtaatggtat caggatacac accacttgaa 

167701 . atcatctcct ggaaaatgtc tagagcccca ttaatattac ccactttacg aaaaccacaa 

167761 atcaaagtga tgtaagtaat tgcgttagca actatccctc ttcgacccat ctcgcagaaa 

167821 agctccagcc catcatcaac ccttcctgcc ttacagtagc cattaatgag tgtagtaaag 

167881 gtcactacgt ttggagagaa gctcttgcta cccatcgaat caaacatttg tgtagcctca 

167941 tctaggcggc tctgcttgca taatccatcg atcattgagc tataggtgat agtatctggg 

168001 actatacccc tgtggggcat ctcctcgtat aattcctcgg cctctaaaaa cttcccttca 

168061 ttgatcaagc cgctgatcaa tatattgtaa gtttgaacat caggttccac accattgaag 

168121 gggtgactag catcaagatc cttcttactc ttctgcataa ccttaaacat ttccaatgca 

168181 tcttttagtt tcccattatc gcagagacca tccagcaaag tgtcacaagt aacgatatca 
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168241 gggcacaaac cactagagat catctcttgt aaaaggtcta gagcagcatt aagatcgccc 
1683 01 accagataga acccgtgaat aagagtgttg taagtagttg tgtcagcaac taatcctgtt 
168361 tcagtcatct catggagaag ttccattcca tcatctatcc tcttagcccc acaatatccg 
168421 tctatgagag tattgaaagt gattaggttg ggagagcagc ccttggtagc catcaaataa 
168481 aacatgtgct cagcagcatc aagacgattc tgtttgcaaa atccatcgat cattgaacta 
168541 tatgtgattg tattagggat tatacccctt ggaagcatct catcgtataa ttcttcagcc 
168601 tcaaagaact tgccttcctt gacaaatgca ttgatcaaag cattataagt tacaacatca 
168661 gggctgatct tcctttctaa catttcttgc aacaactgct ccgcgtcgct ccatctacca 
168721 gagctacaaa aaccaactat catactgttg taggtaaata aatcgggaaa gattcctttc 
168781 tcttgcattt cagtgaaaag attttgtgca tcgctatgac gtccgtcttt acaaaggcta 
168841 tcaatgattg cactatagat tacaacattg ggtatgatgt ggctcacctc ctccatcttc 
168901 ctcagcagat tcagtgcaga cacagtatct cccttcttac acatcccatc tacgattgtt 
168961 ccataagtaa tctgggtagg ctggagacca tcttccatca tccgatcaag cagagctacg 
169021 gcttcgacaa ttctaccctc gcggcaaaga ccgttcatca aagtggtgaa ggttacgaca 
169081 ttgggcctac atgtcgtttc aaacatttga tgaaaaaaat ccaaggcttc agaaacccta 
169141 tcttccacac ataatccatg gagcagggtg gtgaaggtaa caacatcagg gtggagtcca 
169201 agcttggtga tcttaccaaa tgtagacaaa gcaaagggga gcttagagca gctgcagaaa 
169261 cattttatca gaatattgaa gctgtatata tcacatcgaa tctgtttcct ttccatcttc 
169321 tgatagagag aaatcacaag atccgggcgt tceattctca ccaccacacc catcaattta 
169381 cagaaatcaa ccacagaagg taaaggacga gatcgaagca tgtcactgaa caaatcaatc 
169441 gcatcctcta aacctttgat ttcatgaaac ccactttgca gcttcaaact ctctcctcca 
169501 aaacctgctt cgcaactctc tccgcttgcc ttggccagag tatcacgaat cgatctcgta 
169561 cagaacaatc tagccgcaga ctcagcagga gaagaagaac acttgaatcc acaaacccta 
169621 gccaacattt tatttttgtt tcgcctaaat tttgttcttc gtcgatctct aaaaaaaact 
169681 cgtgactgag aaaataaacg agagataaag tacaaacggg ttccatttgt tttacaagaa 
169741 ccggttcagt ttgaattata caatttccgg ttcagatttg tcaccagttg attgattgag 
169801 ttccttgttt aaacccacag ggccacatga ttcactaaac ccagagccga gagaaattga 
169861 taaaccgagc tattgaaacc agtagagttc cggttcagta tttattacta agccttcttg 
169921 tttagtccac ttttaaaaat caagacgacc aatcaaagac catttacaga ctatccactt 
169981 ctaaaattat taaacgatca taaaaattta tcaacagcca ggtgtgttaa ataattatat 
170041 tagaggaacc agttaatgac atttttattg aaccagttaa tgacattttt attgatgtta 
170101 taagctaact taaaatataa acaaagttta tttactttgg gagaaccttg gccatcaata 
170161 tcatccccta tttcattcaa ctcttaagta actaaacatt tgactcaaaa ctaaattgga 
170221 tatgtacata aacaatgagt tggacatacc acactgcaat gagaaggaat cgagaatctc 
170281 catactaatt agaaagacct aaccaaaatc taacctatta atttagggtt ctatttttta 
170341 tctactgttc agagtagggc tggacaaaaa aactgaatcc aaagaaccga accgaatccg 
170401 atctgcaaaa gtaataccaa atccgaaccg aaattaattg aatatccgaa taggtttaaa 
170461 tttttggtat ttaaagaacc aaaaccgaac ccgatccgaa ccaaaatatt ttggatatcc 
170521 gaatgtatcc gaaatagatt aatatacata tatatttaac tatttttaga ttgaatatat 
170581 attaaaaagc atctaaaata tatataatac ttttaaattg ttcaaaatac tagaaaatat 
170641 ataaaaacat taaaaagtac atgtataaat agttaaagca tactcaaaac accaaaaata 
170701 tataaaatat gattgatttt ctatccaaat atccaaatca aaccaattta catgttaagt 
170761 tttggtactt cgacacatat tattgaaatt tatatgtaat atattatttt gtttacaaat 
170821 ttcgaaaagt ttaaaatata taatgaaatt ttaaaatttt gaaaataatt taaacgggtt 
170881 atccgaaccc gaaccgaatc cgcaaggatc cgaaccgaac ccgaaccgta atttagaaat 
170941 aaccgaatgg ggctaaaatc tttgacccca aaaatccgaa atccgaatag actcgaaccg 
171001 aaacccgaat aggtacccga acgcccaccc ctagttcaaa gattaggacc acttagttaa 
171061 atatttaata tgacatatat gtttatttat attaaatctt aataatcaaa aaaaaaaaaa 
171121 atcaatttta acaatacatt tgaattttat tttagctata acaaaatttg ttgaaaaaaa 
171181 tctaacaaaa tcctacccaa cgttttaaat atttattata aaataatgta ttaattaatt 
171241 tttgcaatta gtaatataat aatattataa ttattgttat ttagaatttt atcataaaac 
171301 aatgaaaata aaaaattatg ctattttata aatttatcat tataatctat gttgtactca 
171361 aatagaggtg ttctaaacta aatataaaag atgtcaaaaa aaaatgaact aaatataaaa 
171421 taatatatta aattggttga ttaaatctat ttttaaaata cttttatgta aataaattta 
171481 tcactcacgg ttaaaacatt gtaagagcag ctctgctcgt tttctttttt gcttcttctg 
171541 ttctcaaaaa cttgatgaaa tcacagaaac tgagtcaagt atatgatcag gaacacagaa 
171601 caagtgataa acaaaatcta tgaatatgtt gttagtgata ggagataaaa cactacttga 
171661 ttccatggaa aacgtcacca acggcgcaat aagaatcaat agagaagcgt agctattctg 
171721 taatcataac cgatggaatg cacctacaaa ataaaattat ctagggattg taatttggaa 
171781 aagaaaaatg tctttcattc ttttatgggt caattagggg aaagggcaac ttttaagttt 
171841 gaattaggaa aataggccac ctcaaattta tattaggcaa ttgggcaacc tagtagagag 
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175561 gaaatctaac ctagataatt tttcccatta gattactttt aagaaatcaa tttttaaatt 
175621 ttaataaaca tttttagtat ttatactttt aataattaat aaatgatata tagtattttt 
175681 gtatgatatt aagcattcat tttaatcatt tttagtgtat aaatattaca tttaggtgtc 
175741 atatgtataa atttcataaa caggggcttg gaatgtacat gagcgtaatt gtttcttatt 
175801 cttcagttga agaaaaaatg atttttcttc ttcagttaat ttaattgtaa aggtggtggt 
1758 61 gtagaaaaga atgagagaga aaaaagaaga tgagaggaaa agaaggagag ggagatgatt 
175921 ttcgtagatg atggttggtg gtgtggaaaa gaaggagaga gaaaaagaag gaaagagaaa 
175981 aagaatgaga gaggtggtgt gacaaattaa agaatgagag atgtgatgtg acaaatttta 
176041 ttttagatat attttaattg aaatggcaaa gatgtaaata atgaatctaa caaaggtccc 
176101 aaaggatatt tagacataag tttacatggg caaatgcaaa aagttgccca attggctaat 
176161 ttaaacttga ggtcgcccaa ttccctattt caaacttgtg atttgcccaa tttgccaatt 
176221 ttttcttctt ttattcatcc tccaactgat atcctgcaaa acaaacacac caaaccgttt 
176281 atgcaagctt ccctgatgct acagcaaaag ctgcagctct tctccagaaa cttaaaggta 
176341 agcaacgaaa tttgaaaaca aaattgttga atgacatacc cacactcatc tgcagatcct 
176401 caagcattgc cactgccctt tttagttcct ctttactcca taaac.cagtc agcatattgc 
176461 ggatagtaat__ggtatcagga-tacacaccac ttgccatcat ctcctggaaa atatctagag 
176521 ccccattaat attacccact ttacgaaaac catgaatcaa agtgatgtaa gtaattgcgt 
176581 tagcaactat ccctcttcga cccatctcgc agaaaagctc cagcccatca tcaacccttc 
176641 ctgccttaca gtatccatta atgagtgtag taaaggtcac tacgtttgga gagaagctct 
176701 tgctacccat cgaatcaaac atttgtgtag cctcatctag gcggctttgc ttgcataaac 
176761 catggatcac tgagttatag gtgatagtat ctgggactat acctctgtgt ggcatctcct 
176821 cgtataattc ctcggcctct aaaaacttcc cttcattgat caagccactg atcaatatat 
176881 tgtaagtttg aacatcaggt tccacaccat taaaggggtg actagcatca atatccatct 
176941 tactcttctg catagcctta aacatttcca atgcatcttt tagtttccca ttatcgcaga 
177001 gaccgtccag caaagtgtta caagtaacga cattagggca cacaccacta gagaccatct 
177061 cctgtagaag gtcttgagca gcattaagat cgcccacctg acagaacccg tgaataagag 
177121 tggtgtaagt aattgtgtta gcaactaatc ctgcttcagt catctcatgg agaagtttta 
177181 ttccatcatc taccctctta gctctacagt atccggctat gagagtattg aaagtgatta 
177241 tgtccggaga gcagcccttg gtagccatca aataaaacat gtgctcagca gcatcaagac 
177301 gattctgttt gcaaaatcca tcgatcattg aactatatgt gattgtacta gggattatac 
177361 cccttggaag catctcatcg tataattctt cagcctcaaa gaatttgcct tccttgacaa 
177421 atgcattgat caaagcatta taagttacaa catcagggct gatcttcttc ctttctaaca 
177481 tttcttgcaa caactgctgg gcttcactcc atctaccaga gctacaaaat ccattaatca 
177541 tacagctgta ggtaaataaa ttgggaaaga ttcccttgtc ttgcatttca ctgaaaagat 
177601 tttgagcatc ggtatgacgt ccgtctttcc aaaggccaga ttaccacatt gggtttgatg 
177661 tggctcacct cctccatctt cctcagaaga ttcaatgcag acacagtgtc tcccatctta 
177721 cacatcccat. ccacgattgt tccataagta atctggttag gctggagacc atcttctagc 
177781 atccgatcaa gcagagctac agcttcgaca actcgaccct cacggcaaag accgttcatc 
177841 agcgtggtga aggttacgac atttggttta cacatttgat gaaacaaatc caaggcttca 
177901 gagatcctgt cttccacaca taatccgtgg agcagggtgc tgaaggtaac aacagtggga 
177961 tgaaaaccaa gcttggtgat cttaccaaat gtagacaaag caaacggcag cttagagcag 
178021 ctgcagaaac acttcatcag gatggtgaag ctgtatgcgt tacatggaac ccgcctcatt 
178081 tccatcttcc tatggagaga aatcacaaca tcgagccttc ccatcctcac cacaactccc 
178141 atcaatttac agaaatcaat tactgaaggt aaaggacgag atcgtaccat atcaccgaac 
178201 aaatcaatcg catcttctaa ccctttgatt tcgtgaaatc cgcttcgcag cttcaaactc 
178261 tctcctccaa aaccactctc tccatccctg cttttcttgg ccagagcatg acgaatcgat 
178321 ctcgtacaga acaatctagc cgcagacaca gcaggagaag aagaagatcc ggatctataa 
178381 accctagcca acatttttgc ttcgcctgaa ctttgttctt cgtcgatctc taaaaccact 
178441 ctcgactgag tggctagtaa accggttcca tttgatttac aacaaccggt tcaatttgaa 
178501 ttatacaatt tccggttcag agttgaaccc actcagcgcg tcggtcccgt caagctaata 
.178561 atttctttaa aaaaaaaaaa aattctctcc tctttgggca gttaacaata actaaactga 
178621 tcaaccaatg ttccaatgta acaaaaaaaa aacattatcc tataaaagta gttaaacatt 
178681 accctataaa agtaggttga aagattttca ccgaaatcaa gttaaataga acgataatac 
178741 cgactgatta agctggtctg acacattcta tgttatagaa ttttccaatt ctgactatga 
178801 gtaccaaaag agaataaacc aaaaaaaaaa gactttctta ttcgaagtaa ttgctggaat 
178861 gctggattag gtaaggtcac atcttccttt tgtaacttca ctctgactct cctacaaaga 
178921 catgttcaca aattacaaac caactcattc ttgttatgag aaatgaaaga cagcaaaatg 
178981 gagtttcaaa tgctttgcag caatcttatt aatctcaatg cactttaggt tttcacatgt 
179041 atcactgaaa ctgctttttc accaggtagc tacaattttt ctttgaatta ctaaccattt 
179101 cttggtattc ggcatcctaa ttctcatcaa caaagcaatc aacaaagagc aatcaacaag 
179161 agcagattaa tgccctgcta atacaaatac cagcagacca tcctgatacc ttaaaagtgc 
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179221 tcaataagtt attggcatta accaactacc taaactccca ttttgatgtt aaactaccta 
179281 atctgaaata ttcagaatga atttttggga atcctatttc cctcctgctc gaccctattt 
179341 gctcagagct cataattggc atcataatta tatccgcaat attttccagg ttacgtcaaa 
179401 tcaaccaagt tacaaagaaa atgatgaaca tattaaaact aataacaact ttaggaatct 
179461 atatttttat aagagctcag cttacttcag ttgagtactg atcttacgtc aacaagtgga 
179521 tcagatcttt tttgtcatag gctcatagca gacaaatcaa ttcacctcac aactcttctc 
179581 tcgagggtta agaaatcatg ttgaggaagc ttttgtccaa tctcccatca tataacatat 
179641 tagtgagtaa accaaaggtt gaagcatctt cagaaaaccc gcaacctctc atttctttga 
179701 tgagttctgc tgatacagaa atgtcaccat ttctaagatg tgctctgatt agcacattat 
179761 aggtacggtc atctcaaaga gacccatctt ctttcatttt tataaataag tcatccgctt 
179821 cctgtagtat agacgttttg tacccaatcc tgagattatc gtattgtatg ttataacatc 
179881 gggctgcaat cctttgaggc tgaggctatt gaataaatgt tgtacgtctt ccactttccc 
179941 agcctcgcac atcccttcaa tgataatgtt gtaagtgaag atataaactt ctatttgagg 
180001 ctgaggcctg aaaaaaattg gatattttca aatacttagg atattttgga taaaacatat 
18 0 061~~taggataat t ttggatgaat caaatatttt ataataattt agtttttcaa atattttaga 
180121 tatttttaat agatttttaa attttatata tatXTtttgt tatgtaatta tgttatatgt 
180181 atgtaatata tatatatata tatatatacc ctttcgatta tatattcaag tgttcgttcg 
180241 gtttcaattc tgttcggtta tttcgaatat ataaatttag caatcattcg aatatttaag 
180301 agttttagtc cagtttaatt atgggtattt tgatttggat ccgtccggtt cagttatagg 
180361 tatttctatt cagttccgat tcatttcttg ggttccaatt tttttgccca gaactactaa 
180421 ttagcaagac ataccaaatg aacacttatt atctatttcg ttactgaagc atctaagact 
180481 cactggaact tggctggaaa atatgtttac aaatgcgatc aaatatatat gcacaaattt 
180541 ttttaaaaaa aaactacaca aagacagcca tatacaacaa ctgacaacgt caaatattta 
180601 ttacaaaacc aataaagaga gacaataaac tatctctaag atcctgaaca taatttggga 
180661 ttctaaattc agattctata taactactag atcttaagga gaagccttga aagttagcta 
180721 agtgtgcctg attgcatcac agggaaggac caatcgctgc tgttcttggt cgaatctcta 
180781 tgctcatcat caccggtcaa atgtatctca tcggtggtgc cgaaatcgaa ttctttattg 
180841 gatccggagg aggaatgttg agctgcttct tgatcagacg cttcagggaa cgttgttctc 
180901 agcttcttat caacgcatct aatgaaatga tccgcatcaa gatcgaacga aaccggtttg 
180961 ttcggtctca caatcttgtt ctgctcttta ggagttgtaa cagacttgga gcaacgcagt 
181021 ttaccagggc tcagcagctt tggtggatca gagacttgaa actgagcgag tccgtcagga 
181081 aacggagaag tcgcaccaga accggacata accgagctgg gggagataag ctgagcgagt 
181141 ggactaccgg gaggaagttg gtaaaactga aactcgtagt tagacaccgg agacctgaca 
181201 ccgtagttac tgctggagtt aaagagctga gcgaaaggga cttcaggcga agacggtgtg 
181261 gtggtggtta agtagaaaga ctcgtcgaga ggcggcgtga ccggagctga agacggttcg 
181321 gttgtgtaag tagagaaaac cggaggagaa accagctgag gttcgtgagc gtaaggtccg 
181381 atggcgaaga tcgaaggacg ttcttcgctg ttgttgttgt tgttgtggct gttagaaggt 
181441 agaggactaa aggagaggat ccctacaggt gactgtgtag cggaaggagg ttctgattgg 
181501 aagaaggaag ctggagagga aggtggggct atgaaaggaa gagccgtcat aaccgaacga 
181561 taaccggaat tggatgtgga atcggtagga accggttcag gagcaagagc agctttcccg 
181621 attcgttttc ttctctgtgt tgaaggtctg aaacatatga aacgattcca ccatcttcgc 
181681 ttcttctgct attcaaattt tttttttttg attgaagtca cataaacaaa ttagaatcta 
181741 ttgtgaatga gtcaatagag aaacgtagct atggagttcg cctacaaact ttgttcgatc 
181801 taatgctttt gcgataacga aatcattata cagttctcta gatgatcact aaaacctcga 
181861 gaaaatgagc taaatccgat gaaataggga ctcacatgaa tcggggaagg ttggtgatga 
181921 acacgatcat cagaggaagc "gaacgcagta gcggctgcgt ttatagtctc caaaacgttg 
181981 tttccactcg cgccgcctct catcttcctc tgatcttata taactttccg gagaaaacaa 
182041 aacagagcat tcgaatttct ctcaaagtat caatacacag aaaaaaaagg actatgaaaa 
182101 tcaatgtctc ctgtgagtaa agttgaaaaa caatttataa ataacgaaga cacgcaaccc 
182161 gaaagttctg ggaatattgt ttctctcttg agaaaatata tagatacaca aaccagcata 
182221 cgtgtgtgtt atatgtatgt gtatgtgtgt caaggtttga ggtcaaaaca cggcaaagaa 
182281 caaaaggcgt gcctactatt cattttgtaa attgatgtga tttgaagggt aattaggtct 
182341 attaaacttg cgaagaattt gacttctttc cttttttgtt tttttttgtt tttttttcaa 
182401 attaatttta agataaattt aaatttatca gtttttattt cattctgatt taaagatttt - 
182461 cattttatcg tgaatatttc actacttttc tggtcactat ttcactaatt atagattagt 
182521 acaagaataa aatgagatta gcaaattgat ggtgctgttc gtttgctcat ctgggtgatc 
182581 catctaggtg aagatgcaag ttgatgtttg ttttgtacat taaaatgcta catccagatg 
182641 gatcacccaa ctgcatttat gaaaattcat ctcaaattct cacccaaatg agggtgagtc 
182701 ttcacggagc atctggatga aggtgagtct tcaccaaaaa tttgataaat ttttcgttaa 
182761 aacaaaataa attcttgcca aaaccgaaaa atgcaatttt ccgtcaaaat tagaaaacac 
182821 aatttctcgt caaaatctga aaaacataat ttcgccaaaa ccagaaaacg taatttcccg 
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182881 tcaaaatcga aaaatgcaat atcccgccaa aatcggaaaa aggcaatttc ccaccaaaaa 
182941 tcggaaaacc caatttctcg ccaagaccaa aaaacacact ttcacgccaa aaccgagaaa 
183001 acgcaatttt ccgtcaaaac cgaaaaacac aatttcccgc caaaaccgac aaacataatt 
183061 tcccaccaaa atcgaaaaac gcaatttccc gtccaaaccg gaaaaacaca atttctcgtc 
183121 aaaaccggaa aacgtaattt tcccgccaaa accggaaaaa cgtaattttc cgccaaaacc 
183181 ggaaaaacgt aatttctcgt gaaaaccgga aaacgcattt tcccgccaaa accggaaaaa 
183241 cgtaattttc cgccaaaacc ggaaaaacgt aatttctcgc caaaactgga aaacacaatt 
183301 tcccaccaaa accgagaaaa cgcaatttcc cgccaaaact gagaagacac aatttctcgt 
183 361 gaaaaccgga aaaacgcaat ttctggccaa aaccgggaaa acgcaatgtc ccgccaaaac 
183421 cggaaaatgt aatttctcgc caaaaccgga aaaacacact ttcccaccaa aaccgagaaa 
183481 acgcaatttc tcgccaaaaa cgaaattttc cggcaaaacc gaaaatgtag tttcccgtca 
183541 aaattggaaa aacacaattt ttcgtcaaaa ccggaaaaat gcaatatccc gccaaaattt 
183601 taaaaacaat tctattttaa ttattttaat aacaagttca tctcgatgta gatgcaagtt 
183661 gaaaaaagca aacaaacata gttgcattta gatgattcat ctggatggat aaacgaaatg 
—183721 cagagacgaa caacatctag atgaagtatt tggataaggc atcaagatag accatctgga 
183781 tgcatttttg agatgtacaa acgaacaggg ccatattctt acgaacataa agattgagat 
183841 gtttttgttt tattttttat ttttcatgac gagataagtg tcacagaaac gcacgtgtgg 
183901 gagaatcgaa tctatctctt ttagttttag gtttctcgtg tgcacattag tgtggggcta 
183 961 aactaagtaa tttaaaatta ttaaataaat tatatatttt gaattagaaa tgatgtcaca 
184021 acttctggcc cctaaaaaca ttattatctc acaaagccaa tttcaacttg ttttgcacgg 
184081 tggcttgact tcatgttttg ggtcttgttt acttttctat ctctccatac ccaataatta 
184141 aacgaatttc ctataaatgc tttatggaaa ccataattca atcttttggc acattataat 
184201 ttaatcgtta aatagcccgc agaagctgat agccactttg taacttgtta gaaaatggat 
184261 tactgtttgt atttgtccaa ttgactgatt cttggttcca aaaaggggac ataacttatt 
.184321 gaatcattta cacaaaaacg gaagacaggc acagttaatg tgatttggtg caaaaatgtt 
184381 tagctgtctg tcaatcaaat aaatacgcag atctttccag attcttgaaa tttttcatat 
184441 tagttgggaa attcatttta atatcccaaa aatatagaat ttgaaagagt tttctgttaa 
184501 attataatag gtgagggtta taaaaacata attactactt aacattacaa taaatagata 
184561 acataaaaat gaagaaaaag atcaaaagaa ttaaaaagac gtttttaaat aagatttttt 
184621 taaaaagttt aagacattaa aaaattgttt tatcattaat ctaatttttg tatttaaatt 
184681 ttaattttaa tgtggagcta cattaatatt aatattatgt tatcaaatat ttttttagga 
184741 tcctttagta ttgatgttgt tagaattaag tctaaaaagt taactatgaa aaagatagaa 
184801 aaactagaaa atgatgactt ttactaccaa gtttaagaat acactaggtt ttggtccgca 
184861 ctttaaaagt gcgagacttt tttgttataa tttagtacaa aaaattcaaa gatgatattt 
184921 ataaaaacat tttattatta tattgaaact ctgatatgga ttgtcgatat caaaatcaaa 
184981 ataagttttt cctaccatat atcttagggg tgggcacttc gattatcttc tcggttcgat 
185041 tcaggatcgg gttggtttgt ttagtttggt tctagtaatt tctaactgaa gtaaatcata 
185101 gttagtttgg ttcggtttct cggttcggtt tagtttcaaa ccgaaccatt cgggttggaa 
185161 aatcttcaat cgaattattt gaaaaagatt tcggttcgat ttgaatcggt atcgggtcga 
185221 ttagttccgt ttgactcggt tcggtttgaa ttttttgtcc .acccctgata tatctaaacc 
185281 gtgcatttct tgaaatatct tatcgtggta accgatgatc catactttaa cagcgaggtt 
185341 attttgtttg tacattttaa tttgaaaata taaatatttt gatccgtgtt tctgaagcca 
185401 agtaatttat ggtttgaaat tatagtaaat ttacattttt ctattttatc atatataatt 
185461 tatgttaaga ttcacataga tattatattt taattatttt tcagtgagtt caaactttgt 
185521 atctttaata aaatttgaat atgcgataaa ttatcatttt ttccagctta aaggtggatg 
185581 agtaatcatt tattagaaat gttattttct gtcctattat tattattata aaatattgct 
185641 ttcaatttga ttttaaaact agaaatggaa cacttatttt ggaatttgta aaaataaata 
185701 taatagtata atatatagta tcgaataatt tcaaagttat tttcatataa taaaaatgta 
185761 tatgtatcgt tcttttgttt gaatttaact aattaggtat gtgcagtaaa gtgccatcac 
185821 taagttatgt tataatcata taaactccac ctatctatgt tagcaatgat gataatatat 
185881 attgaaacaa aaaaaataaa aatgtaaatt tacataggaa aaaaacgtat atgtactatc 
185941 acatatacaa ataaaaatgt gactatttaa ttcaaattct atgtgaaact atattaggct 
186001 gaatattttt tgttcagatt ttctaaagat ctcttcaaaa tattttaggt aatttaaata 
186061 ttaaatattt aatatatttc agtggcattt ctgtaaatag tttgaaaaac taaaggaatg 
186121 tttgaaaaat atatgctgtt ttaattgtat tgatttgtca tgttattttt ggtagtcatt 
186181 ggtaaacttt ttacttggtt actctctccg tcttcaaaaa aaaattataa agaaaatata 
186241 actaatattc ataattaaat ttattattta tttaaaaata tactttctga taactatcag 
186301 ctaacaatat ttaattaatt taaatattct caattaatgt ttttaaaaaa tatacaaaat 
186361 agctttaaaa tatataaaaa gtctattttt gtgaaacaaa aaaaaatcta gaaaaattta 
186421 ttttcaggga atagttttta ggaaaaaaca ctttctagaa aactatagca acttcaatag 
186481 ttttctggct taatactttt aatataatgt tgtctgaaac taaaaatcaa accttactga 
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186541 aattataata agctgaaaca gtttacaaac caaaaaatca acatttagtt ctcatgtaaa 
186601 agtaaatttc cggcctaaga aactgattta tccaaaagta attctatact actcccacca 
186661 taataatata tgtcatattt ggtttcaata tttacaaacc tgtaactttt ttaagtaact 
186721 tggacaatag agagatgtaa gcacatatat ttttattgaa atgatctttt gttaaatatt 
186781 ttaccatcta tttataaaca aatgtaaagt tatttagttc gtagactcgt agttatacca 
186841 ggaaaatgaa tttgagcttt gtgaattaga tttatgtcgt tatctgaaat gaaatagaaa 
186901 gaatatcatc gttaaagttg agcacattag ttagcaaaag aaaaaactga gcacattatt 
186961 gtccattttt ttcttttaag acttgtcggc agttcaatga gttaggtcat caaccgcact 
187021 tccggactcg tgctttgaca catgtcacca acccaccttg cactcaaaga cacaacacag 
187081 tttaaaaata agagatcatg aaactgggat acaattatac tcttattcaa ttttgataat 
187141 tgtactctta ttcaatattg acaacaatta gtttcaaaaa aaaaatttga caacaatttt 
187201 ttttttgcaa tgagataata agatgacacc agctttcacg tgcatcaaag ttggatgaaa 
187261 aaggacaaga agtttacgtg atatttcttc caaagtaaaa gccacaccaa aataaaagtt 
187321 aagaaccata aaagtcaaat taaaataata tactccaaga ttcgaactta tcattaaaag 
1873 81 ttcctrtftact gcaaatttac aaagcaagca acagcagatt caataaaggt tcaaatttaa 
187¥41 tggcagggca tcaggtttgg tgtagaaaat catttggcaa gatagactag aaatgataag 
187501 aacagaacct gaatttcaaa gagaaggtac gagcttcagc attgtaagag gatgtcactt 
187561 tgacgactgg cgtcccagct tgagggtacc taaaatatca atattgtata tccttattag 
187621 aatgaaaacg ataatattca caagacattc cacagtagaa atatgactaa ccattgcaag 
187681 aaattagcaa aatctgcatt gtttgcatca cgcatagcag caaagaagtc ttcacaggtc 
187741 acagcttgct catcgtgtct ttgaaaataa agatcaattc cctgtaacat atgcacacac 
1878 01 agaagtgagc gtaaggacca cgaaataaga cctagatact ggatacacaa aaaagagtat 
187861 tggaaatcaa cctttcggaa accctcactt cctagtagag ttttgtacat cctcacaacc 
187921 tcagctccct aagaaaacat tatatagtca ccttcagaga ctcggtatat ttggctgtgt 
187981 cattacatat caatttatgg cgcaacctta tctgggaaag ttggggctag tgegaagtga 
188041 cataggtgtt gatagaattc cagactgcat taaagatctc caacggttaa aaaatcaaag 
188101 tatattttgc tgtccaaaac ttgcataact gcccgagctt cctaggtcgc tcacaacact 
188161 aagagtatat agatgtgaat cactggagac aatatagtac cgttcccttt aggttctgag 
188221 attgaggata tctatttccc tgactgctac agattgggtc gtgaagcaag gagagtaatt 
188281 acccagcatc cgttgcagga atgcctacct ggcatacaca tatacctgca gggttccagt 
188341 acctacctat ctgttcaaaa gcctacctat ttaagttttg tgtggtggtt ttccctaaac 
188401 atggaaatgg tagaacatag tggtattgaa ttactgtgtt acatacgcat gaatggttgc 
188461 cctacgaaga ggagaacatt ttcttttgat cttccaatcc aatcagaaca tctgtttata 
188521 tttcacgcca tagtgtttaa ggaagaagac ctacagcatg aacaatatcg ccagatatgt 
188581 ttcctgttca gcaccacatc ccaggaagtc gaaattattg aatgtggcgt ccagatcgtg 
188641 agggacagaa gtggcagaaa aagaagcgac gccgatgctg tgaatataaa tcacaacaac 
188701 aactgttgga agacgacgat gaaagtctct acggtagcct tgactatata tgatgcacca 
188761 agagtagcta ccattaagga ttttgctaag tctctttctt cactttttat tttcctattg 
188821 agtttgattc cgagtgttcc tttttatagc ttactaaagg aacataaaga cttgataagt 
188881 gattatttca catcattctt ttccttctga gttcgatttg aagtgttact tttgtgtaac 
188941 ttccttttaa gacttgggta tgtgatcttg tcgttgtatt tactcatcag acaatgttca 
189001 gtagttgtgt atagacttca gctacctact cgttttgcat tttgttaact tttcaaagaa 
189061 attctacgtc tgctgggaag atgttgacgt tgcccaaact gctcttgttt agctgttgag 
189121 attttcaagt gttcttttga atagctttta ttgttttaca ccttgactac agttattagg 
189181 aagtatacct gggaaagtgt cttcttcctc gtatatttag ggtcctaata tgttgtgggg 
189241 ctccatttag aacaaaacaa tagctgttct gttttaatgg tttatgaaaa tgcagaaacg 
1893 01 tctcaatgga atgatatgat catatgttgt ggacaggcac atacagagta catttatatg 
1893 61 aatcatgggc ttgttggaag aactagactt cattagaaag agatgaagtt tccttcaaaa 
189421 gaagtctcgc ctgaacaaga agaaagcaga acaagccaaa agcgacaaca acaaggatcc 
189481 ataaaaagtt ccagtttcag gtaataactt gcgtagaata agtgacaaaa tatttagttt 
189541 gaatatggca cacaagatta gacaagaaga cgttgacgaa aaaaaaaaaa aaaacaagaa 
189601 gaccataacc ggttgaatgc cggtttaaaa cttaacatct aaccggtaac cggactagcc 
189661 attctcattt gaaaacaact aataagcgaa gaaaattcag ctcaagtcaa cacactgagt 
189721 gaaaggactt taccatcttc acatctggct catcgggaga gactttcatc gatactctgc 
189781 tctcatggct tcttcttcct cttcttcctc gcctcgcaca tggagatacc gcgtcttcac 
189841 gagcttccac ggacctgacg tccgcaaaac cttcctcact cacctacgca agcagtttaa 
189901 ctgcaacggg atatcgatgt tcgacgatca agggatcgag agaggccaca ccatcgcccc 
189961 tgctctcaca caagcgatca gagaatcgag gatctctatc gtggtgttaa cgaagcacta 
190021 tgcttcttcc aggtggtgtt tggatgagct tttggggatt ctgaaatgca aggaagagat 
190081 cgggcagata gtgatgacca tcttctacgg agtagatcct tccgatgttc ggaaacaaac 
190141 cggagatttc gggaaagtct tcaaggacac atgccgtcgt aaaacagagg aagagaggcg 
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190201 aagatggagc caagctttga ccgatgtggg aaacatagct ggggaacact ttctcaactg 
190261 gtttgtttgc tttttcttca ctcttgtacc tttcttgatt ctcacatatg aacgtaatct 
190321 cttgttttag ggacaaggaa tcggagatga ttgaaaagat tgcgagagat gtctcaaaca 
190381 agcttaatgc tacaatctct agggattttg aagacatggt tggtattgaa gcacacttgg 
190441 ataagatgca gtctttgtta catttagatg atgaggatgg agctatgttt gctggaatct 
190501 gtggccctgc tggcattggt aagactacca ttgctagggc tctacatagt cgactctcta 
190561 gcagttttca tcttacttgt tttatggaga atcttcgagg aagctgtaat agtggtctcg 
190621 acgagtatgg attgaaactg cgtttacaag agctacttct ttcaaagatt tttaaccaga 
190681 atgatatgag gatataccat ttaggtgcga taccgcaaag aatgtgtgac caaaaagttc 
190741 ttatcattct tgatgatgtg gacgatctgc agcagcttga ggctctggct gatgaaacta 
190801 actggtttgg tgatggaagc aggattgtgg tgaccacgga agatcaagag cttttggagc 
190861 aacatggtat caacaataca tactatgtgg atcttccgac tgacgacgag gctcgtaaga 
190921 ttttttgtag atatgctttc agacggagct taacaccata tggttttgaa actcttgtcg 
190981 aaagaacaac agagctttgt ggcaaacttc cttttggtct ccgtgttcaa ttttacgcgg 
191041— aaagaaagaa gacgactggg aaagtatatt gcaaaggcta gaaaatagca atataccaaa 
191101 gatcgatgca gtacttagag ttggatacga cagtttacat gagaacgaac aaactctgtt 
191161 tctcctcatt gccatcttct tcaactacca agacgatggt cacgtgaaaa caatgctcgc 
191221 tgacactaac ttggatgtca gactcggctt gaaaactctc gcttataagt ctctcacaaa 
191281 aatatctagc caaggaaaaa tagtgatgca caagttacta caacaagtgg gcagacaagc 
191341 agttcaaaga caagagcctt ggaaacgtcg gatcttaatt gatcctcaag agatctgcga 
191401 tgttcttgag ccttggaaac gccaagtcct aactgatacc gatgagattc gcgatgtcct 
191461 tgaaaatgat tctgtacgtt cattttcttc tatctgttac accggttcat ttagcaaagg 
191521 cctatagatg ttaccacctt ttgttctaac ataatttcct ttattttgaa tttttttttt 
191581 agggtagtag aaatttgatg ggggtatctt ttgatatgtc tacaatctta cacgacatgg 
191641 atattagcgc aagagctttt acaagtatgc gtaatcttcg atttctcaag gtctacaaaa 
191701 caagatgtga tacaaatgtt agagtgcatt tacccgagga catggagttt ccacctcgtc 
191761 tgaggttatt acactgggag gtatacccga gaaagtttct tcctcgtaca ttttgtactg 
191821 aacatcttgt ggaactctat ttaagagata ccgagctcga gcaattatgg gagggaaccc 
191881 aggttggtta ttttatttta ttttctgttt gtgtgtactt tggatagtgt ttaatggttt 
191941 ttactggttt gtgaaatttt atataaatct aaatggaata tagaaaacaa tgaaccttaa 
192001 aaaccaaata tgtggaagtt acagtagcat cgtcaatttt gttaagagtt tcaaacatat 
192061 gtatgattcc catcatttca taacattaat atggtattgt gctgttttac attcagcccc 
192121 tcacaaatct caagaagatg tttttgggtt cgtgcctgta tcttaaggaa cttccggatc 
192181 ttgcaaaagc tacaaaccta gagaaattga ggttggatcg gtgcaggagt ttggtagaga 
192241 ttcattcctc tgttggaaac cttcataaac tagagagttt ggaagtggct ttctgttata 
192301 atctacaggt tgttccgaat cttttcaact tggcatctct tgaatcattc atgatggtgg 
192361 gatgctacca actgaggagt cttccagata tttctacgac catcacagaa ctctcaatcc 
192421 cagacacact gttagaagag tttactgaac caattaggct ctggtctcac cttcagagac 
192481 tcgatatata tggctgtggg gaaaatttgg agcaagtgcg aagtgacata gctgttgaga 
192541 gaattccaga ctgcatcaaa gatctccaac ggttagaaga actaactata ttttgctgtc 
192601 caaaacttgt atcactgcca gagctcccta ggtcgctcac attactaata gtatacgaat 
192661 gtgattcact ggagacacta gcacctttcc ctttaggttc tgagattgaa gctctctctt 
192721 ttcccgaatg cttcagattg gatcgagaag caaggagagt aattacccag ctgcaatcat 
192781 catgggtatg cctacctgga agaaatatac ctgcggagtt ccatcaccgg gttataggaa 
192841 atttcttggc catatgctca aatgcatacc gatttaagct ttgtgccgtg gtttccccta 
192901 aacaggtgat ggtggaagat gaagatatag aattactgtg tcacatactc ataaatggtt 
192961 gccccatgaa gagccccatt aagagcatat ataatcttag gattagaatc caatcagaac 
193021 atctgtttat atttccctcc acaatgctca aggaagaccg acagcttgga caatacagtg 
193081 agatattgtt caaatttagc accacatctc agaacactga aattattaaa tgtggtgtcc 
193141 aaatcttgag ggacagaaga agctgtgatt ctaagtcaga acaagacgac gatgaaagtc 
193201 tctacagcag ccttgactat gatgcaccaa gagtagatac cattaagtat tttgctaagc 
193261 ttctctcttc actttttatt ttccttttaa gtttgattcc gagtgttctt ttttacagct 
193321 tactaaagga acataaagac ttgataagtg attatttcac atcattcttt tccttgtgag 
193381 ttcaatttca ggtggttcct ttgtataact tccttttaag actggggtat gtgatcatgt 
193441 cctagtcatc tttgtatttt acatttttac ccgtcaagac aatgttcatt agttgtgtat 
193501 ggacttcagc tatctagtcg ttttttaatt ttgttcacgt tgttgacaag atgttgaagt 
193561 tttcaagtgc tcttttgtat agcttttaca cattgactac ggttattact tattaggaag 
193621 tatacccagg aaagtgtctt cttcctcgta tatgttgtgg gactccattt agaacagaac 
193681 aatagctgtg tttttttttg ttctttgctt gacttgggaa tggtttttga aacagctaag 
193741 cgacgaacat cttaaaagtc aacccacgtg acgtcttctt cttcacatat ggctcatccg 
193801 tagagacttt tttttttctt tgacatcgta gagactttca tcgctaactc taattctcat 
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193861 ggcttctaaa tcttcttcct cttcctcgtc tcgcacatgg agataccgcg tcttcgcaag 
193921 cttccacgga cctgacgtcc gcaaaacctt cctcactcat ctacgcaagc agttcaccaa 
193981 caacgggatt tcgatgttcg acgatcaagc gatcgagaga ggccacacca t tgctccttc 
194041 tctcgcacaa gcgatcagag aatcaaggat ctctatcgtt gtgtttagaa ctatgcttct 
194101 tccaggtggt gtttggatga gcttttggga attctgaaat gcaaggaaga gatggggcag 
194161 atagtgatga ccatcttcta tggagtagat ccttctgatg ttcggaaaca aaccggagat 
194221 ttcgggaaag tcttcaagga aacatgccgt cgcaaaacag aggaagagag gcgaagatgg 
194281 agccaagctt tgaccgatgt gggaaacatt gccggggaac actttctcaa ctggtttgtt 
194341 tgctttttct tcactctttg tatctttctt gattctcata tatgaactta atctcttgtt 
194401 ttagggacaa ggaatcggag atgattgaaa agattgctag agatgtctca aacaaactta 
194461 atgctaccat ctctagggac tttgaagaca tggttggtat tgaagcacac ttggatgaga 
194521 tgaactcttt gttacattta gatgatgagg atggagctat gtttgttgga atctgtggcc 
194581 ctgcaggcat tggcaagact accattgcta gggctttaca tagccgactc tccagcactt 
194641 tccagcatac ctgttttatg gagaacctta gaggaagctg taacagtggt actgacgagt 
194701 atggattgaa gttgcgttta caagagctac ttctttccaa gatttttaac caaaatggtg 
194761 taaaactatt tcatttaggt gccataaagg aaaggttatg cgacctaaaa gttcttatcg 
194821 ttcttgatga tgtggacgat ctgcagcagc ttgaggcttt ggctgacgat actaactggt 
194881 ttggtgatgg aagcaggatt atcgtaacca cggaagatca agagattttg gagcaacatg 
194941 gtatcagcaa tacataccgt gtggatttcc caactcaagt agatgctcgt cagatctttt 
195001 gtagatttgc ttttagacag ctctctgcac cccatggttt tgaaaaactt gttgacagag 
195061 taataaagct ttgcagcaac cttcctttgg gtctccgtgt catgggctcg tctttacgca 
195121 gaaagaaagt agacgactgg gaaggtatac tgcagagact agagaatagc tttgatcaaa 
195181 agattgatgc agtacttaga gtcggatata acagtttgca taaggatgac cagttcctgt 
195241 ttctcctcat tgcatgcttc ttcaactaca aagacgatga tcacgtgaaa gcaatgctcg 
195301 ttgatagtaa cttggatgtt agactcggct tgaaaaatct cgtgtataaa tctctcatac 
195361 agatatccgc agaaggaacc atagtaatgc acaagttatt acaacaagtg ggtagagaag 
195421 cggttcattt gcaagatcct aggaaacgcc aaatcctaat agattctcac cagatttgtg 
195481 atgtcctcga aaatgattct gtaagttgtt ttttttctcc tttcttcttt ttatatgatt 
195541 atgctaatat tttattttat ttggaaatat aatgatttta ggatggtaca agtgtgatgg 
195601 gtatatcctt tgatacatcc acaatcccaa acggagtgta tataagcgcg caagggttta 
195661 gacgaatgcg tgatcttcgg tttctcagca tctacgagac aagacgtgat cctaatgtta 
195721 gagtgcattt acctgaggac atgagttttc cacctcttct aaggttatta cactgggagg 
195781 tatatccagg aaagtgtctt cctcatactc ttaggcccga acatcttgtg gaactctgtt 
195841 ttgttaacag catgctcgag cagctttggc aaggagtcca ggtttgttac ttatatttga 
195901 acacactggt ttaaagttta gaaatctata taactaaatg tgaagtaatt ttgaggaaaa 
195961 aaagtacttt tgatattttt ttaaaaacat aaacaataat ataaaaaatg agtatcatgt 
196021 tgtttgaaaa aatggatagc aatatattaa aaaaaaataa taatttactg ttaattaaga 
196081 aaaattagaa gtctattata taatcataaa acaagattta aactttatat acatatttaa 
196141 aataaaaata acttaaaaat gatttatatc gaaaattgat tcaaaaattt acatatattt 
196201 aaaatttgat ttttactaaa atattttcca ctaaccttat aaaatatttg gaatgtatat 
196261 aaaatgtaca atacaaagcc aaattttaaa taccaagtta aattatggtt tttataattc 
196321 acattaagtt tgaaaatata attcacattt aaaaaatatt ttttctaaaa taatcatcaa 
196381 ataaaattta tttatataca tatttcaaat aaaaataata atttataaaa ttgatttata 
196441_tcaaaaattg attcaaaaat atacatatgc tcaaaaattg atttttacta aaatattttt 
196501 caataaccac tataaaaaaa aatattttct atatatatat atatatatat aaatacaata 
196561 aaactctaat ttcaaacacc aacttaaatt atggtatttc tatttctttt aaaaatataa 
196621 tatatgtgat agtttatata ataatacgta taaaaactat tatacaatga cataatacaa 
196681 aatagtgaca tagaaaacaa atagtaattt ttctttgaaa aacaacatcc gcgcgggtgc 
196741 gtaaacaaag tctagttatc attatatatt tgataaaatt tacaagtttt atattttatc 
196801 taagctttgg caaggagtgc aggtttgtta ttacgtttga acaccgtctg gtttttaaat 
196861 ttagaaattt tatcattgta tattgattgt tttacattca gcctcttaca aatctcaaga 
.196921 agatggactt gtccgggtca ctgagtttga aggaagtccc tgatctttca aatgctacaa 
196981 gtctcaagag attaaatcta acaggttgct ggagtttggt agagattcct tcctctattg 
197041 gagaccttca taaactggag gagctggaga tgaatttatg tgtaagtgta caggtttttc 
197101 cgactctact caacttggca tctcttgaat cactcaggat ggtaggatgc tggcaactaa 
197161 gcaaaattcc tgatcttcca accaacatca aatcacttgt agtcggggaa acaatgctac 
197221 aagagtttcc tgaatcagtc aggctttggt ctcaccttca tagtctcaac atatatggca 
197281 gtgtcctcac agtgccactt ttggaaacca cgtcgcaaga attctccctc gctgctgcta 
197341 cgattgagag gattccagat tggatcaaag attttaatgg gctaaggttt ctttacatag 
197401 ctggctgcac gaaacttgga tcactgccag agctccctcc ctcgcttaga aaactaatag 
197461 tagacaactg tgagtcacta gagacggtct gttttccttg cgacactcca acaactgatt 
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197521 atctctactt ccccaactgc ttcatgttgt gccaagaagc aaagagagta atcacacagc 
197581 aatcattgag agcttacttc ccaggaaaag agatgcctgc tgcagagttc gatgatcatc 
197641 gatcatttgg aagttccttg accatcatcc gtccggctat ctgcaagttt aggatttgcc 
197701 tggtgctttc tcctgcaccg gatatggaag aagcttattt caaattactg tttcgcatac 
197761 gtgcaaaagg ttgtcccagt gatgatgaca tgctttcgtt agatctcgct aaaatccaag 
197821 gggagcatct ttttatattt cacattgagt ttgttgaaca tcacgaggag atggtgttca 
197881 aattcagcac ctcatcccac gaagtcgacg ttattgaatg cggtgtacag gtcttgacag 
197941 atgaaaccag cagaagaagc aatgaatctt gttcagaaca agtgtctgaa gacggggatg 
198001 atattctatc ggatgatgat aagagcaatg agatttatga acccagagta aagatattta 
198061 cgggttatac aatgtttctt tctttagttt ttacgttcct tttgagtttg atttcaagtc 
198121 taattctgta tagattccta aagaactaaa acttgagtgt atgatattgt ttgtgttgga 
198181 ctttctatct atctacccat tcgatagcgg tcaccagtag taatttcacc cgtttaaaat 
198241 tttgttcatc atggattcat ggatacagtt ggaggctaga tagatggtga ctttaccatt 
198301 gttgggagaa tgctcttgtt gtctcgagtt tcttattgat tgaaaatcta attccccttg 
198361 gtgatatgta atctggaaaa ctatgttaga tcgtaagttg ataatgaaag gatccatcaa 
198421 attatccaac ttgaaaaata acacatactg tgtctacttt gcatggactt cctcattttg 
198481 gaaactcgta ggtgcaaata gtaaaataag taaataacta tgggtgaatg ttcattcttt 
198541 caacttgata taaaagaaaa acatcatttc gttagagaaa aagtacataa cataaaaaga 
198601 aaaacatgtg acgttcagga taagtttatt attgtaaagt gcatgcgttg aatgtaaaaa 
198661 gctcataaca tgataattcg gagctacgag tagaaatgtc tttaaaaagg atttgattct 
198721 tgatgtggtc tctcaagcag ccaagctctt agaggcaatc tcaaatacat tttcagataa 
198781 cccattagct gacattatca tctccaactg tgcctgtcaa accaccacaa aacacaaaca 
198841 caacagcttc attcatccag acatcttctc ttaggattat aaaaccaaca aaaatgctaa 
198901 tgattttgtt gtgtgtgtga gagaaaaaga gagacttagt taccttggcc agagcttgtc 
198961 gggtttcatc gtagcgcttc cacctcgaaa acgcagatac catacgagaa gcaacctaga 
199021 ccacacagga aagaaaaaaa aagaaacaag ttgtaaaggt tgtttatttt tgtcaagaaa 
199081 gttaatacgg gagtcaaacc tgaggattga ttttgtctaa ctggacaaca atgtcaccca 
199141 agaacttgta acctgatcca tcttttgcat ggaaattcac tggcgaaccg cagaaccctc 
199201 caatgagcga gtaaacctga atggtcgtcg cattgtttca tcgttagcat acatctgaga 

1992 61 aagaaagttg cggaacgaat aagagttttg agaattcact tgccttgttt ggattgcgca 
199321 gatcaaaagc tgggtgatcc aaaagcttct tgacattctc cacattcccg ggaatgtcgg 

1993 81 atgatgcttg aaggaggaac catttattaa caacctgaga tcagaagcca gataaagctc 
199441 agtatggcat attttaaaca tggaggcaga gggaagtaac gtgtgacatt gaagatctat 
199501 accaagtaat cgccctgcca cttgttatag aaatcagcaa gaacctcatc acgggtttga 
199561 cccggctttt gtgcaagagc tgccaaagca gcaatttggt ctgtcaaatt ggtggccgac 
199621 ttgtattcac ccaatgcaag ttccacgtat gctggatcct caagcgatgc aagataagct 
199681 ggcattattt caacaaagca aacacacaaa gaagttagtg acttccgcac aaagagagac 
199741 tcgtttaact taataatgac tcgcaaatgc caacctagag cagtgttctt caaagcacgc 
199801 ctagccatat tggggtggtc aaagacataa gcctcagtgc tcctattgtt ctcaacctgc 
199861 acaaaaatat aacaacagtt tgaaaagttt cagtgaataa acagcgagtt caaacgaaat 
199921 aaaaaaacag gaacttccta acttcagccg acactgactt gggtctcaaa agatttacta 
199981 tgtagaaaga agaagacata ctatctttag aagctcagtt ttaagttcag atgcaagctg 
200041 ctttctaaca aactttctaa cggcatgaac agcatcagga tccgccacgg ccatcatgtc 

_200101 cataatctct ccctccccag gtaatgttat tgccttggct ataaattcct acaccaatag 
200161 attaagaaaa agataaaaga ttaagaagtc gtatcctatg ctacgcttcc ttgtttgact 
200221 acgtggtcca taagggaatg tatagtcgta ccttgtccaa gctcgagtca gaaagcacgc 
200281 tgccgagacc ctgaatgaac tttgggttta gaaccaatgg tttgttttgt tggaaatcag 
200341 aaactaagtt cagcatgagc ttccttgcca gaacttgacc tgcctcccac ctaaaaatac 
200401 agaataaatg aatgaacgaa ctgcgcgtaa atgaaaaaag atctaaaatg ttgtggaaag 
200461 gcacataccg attgaattca tctgaatcat gggctaagag gaagaacaga tcatcatcag 
200521 agagatcagt ctcaacacga actggggcac tgaatcctct aaacagagac ggcacaggtt 
200581 tttctgatat atcagagaac acaaactctt cttctttctg tacgtaaaca aaacataagg 
200641 tccggaaaga aattgaaaac actggacgtg gaatagagtg gagggcgatg taacgtagac 
200701 tacatacatt acaaaaggtc acgtaacgta ttgctttaag aaaggaattc tgaactaacc 
200761 ttggtcactc gaaggattgt gctgctgctt gaaatggtct gaagtgtacc attatggtaa 
200821 acagaggaaa gtgtaatgtc ttttccactt gagtccagaa gaccaacaac cactggaata 
200881 aatgttggtt cttttgttgg ctggcccgga gtcgggggta tctcctgact ggaaaagtta 
200941 tcaattacaa agcaagccaa tgtgtcaaca gcagattcaa taaaggttca attgtaacgg 
201001 cagggtatca agtttggtgt agaaggtcat ttggcaaaac agactagaaa tgttaagaac 
201061 agaacctgaa tttcaaagag aaggtacgag cttcagcatt gtaagaggat gccactttga 
201121 cgactggcgt tccagcttga gagtacctaa aatatcaata ttgtatatcc ttattattag 
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201181 gatgaaaaca 
201241 aattagcaaa 
201301 cagcttgctc 
201361 gaggtgagcg 
201421 attggaaagc 
201481 cctcagctcc 
201541 cagcaacatt 
201601 tcattctgtt 
201661 taatcttaga 
201721 gattggaaca 
201781 cagatgaaag 
201841 aagttgtcca 
201901 cacaacatag 
201961 gcgaacagga 
202021 agaaaaaacg 
202081 gaat-tgatag 
202141 atcagatgaa 
202201 acaacacgaa 
202261 gaccttcctt 
202321 agtttcagtt 
202381 aatcatttgt 
202441 tccaattgtg 
202501 aaaaacatgc 
202561 accaataact 
202621 gacaagcttg 
202681 aatatcagca 
202741 tttccatggc 
202801 agaaaagtaa 
202861 tcatacatgt 
202921 ccaaacacct 
202981 tacatgcaat 
203041 ccttcagaga 
203101 tcagagatac 
203161 cagccaccag 
203221 gccgacctcc 
203281 catttattcc 
203341 tttccgttgg 
203401 ttcgccataa 
203461. caattgtact 
203521 gtgattttcc 
203581 agcccctaac 
203641 caggaaaaga 
203701 ttgtgggggt 
203761 ggcagagtga 
203821 gaccaagtaa 
203881 aatataaaaa 
203941 cctttagaag 
204001 ccaagggagc 
204061 atggaagaaa 
204121 tgtactagac 
204181 tcttctcttc 
204241 gttcagatta 
204301 agcatacagt 
204361 gtgcatccat 
204421 tgctgtctcg 
204481 taaaaatcca 
204541 cagaatatga 
204601 tgaagaccaa 
204661 aaacctggga 
204721 gaagtgaacc 
204781 tacaaaagcg 



ataatcttca 
atctgcattg 
atcatgt ctt 
taaggaaaac 
aacctttcgg 
ctaagaaaac 
atcatcgaga 
agaatcaata 
ttccctaccg 
gttgcaaaaa 
cctataacag 
tctgcacaga 
ttt caggaaa 
tgtgccatag 
agataaaccc 
atcctgagct 
aactcctgta 
tct teat att 
tagactgagt 
ataaagacaa 
aaattttaat 
gaagtactgc 
acaacataaa 
cccaaaattg 
gaattaaaaa 
caagtaegtt 
tcccctgaaa 
aaagacaggg 
taaaatctgg 
agaggaaccg 
actaaaaaaa 
atacatggca 
ctccctacca 
agcaaataga 
ctacaaatta 
agaaaagggt 
acaacagtac 
tatcaggacg 
ttct caacaa 
ggaaaccctc 
agaataaact 
aaaaagttga 
atatctcagt 
gatgacgaga 
ctaaaccaaa 
tgaaaattag 
cttcccctca 
agaagatcct 
caaagatgaa 
tgttgtacct 
acctagagag 
aaacaatggt 
ttcaaagtag 
tttggattct 
cttatatctc 
gcattggggt 
tagaaactga 
tctcaacttt 
agtaagaaag 
gcttcttaag 
aaattaaggg 



caagacattc 
tttgeatcac 
tgaaaataaa 
gaaataagac 
aaaccctcac 
attatatagt 
caaacctttt 
cataagggac 
tcactgatag 
acacaagcac 
aatcccagac 
aagttaaaag 
agatatgtaa 
gaccagcatc 
atgtagtgac 
ttgaaacatc 
aggaattgga 
atgaaggata 
tggaaccagt 
gcatataagc 
aactaataag 
acagcaaaaa 
atttcaataa 
cagcataatc 
tetgeaaggt 
attgttataa 
atggaaaata 
aagcatgagc 
aaeggegaca 
gttaagacac 
cttttcaaag 
tgagcagtct 
gagcgtgtag 
tagcatggct 
acgaagagga 
cactatt ttg 
aggatagagt 
gtcctaatca 
gagtcatctt 
tgcttcacat 
taaaaaggtt 
aggaacagat 
atcaatttcc 
atccaactgg 
ctgatttaca 
tactacatag 
accttgacag 
gggcaacaga 
tgttctgctt 
ttaactcgag 
aagcttagat 
gattctacac 
taatcaggct 
tcaactgatt 
tgacatatac 
tatatttatc 
ttcttgetea 
aagtaacagt 
ggtctgtatt 
ggaccctaca 
aaatgaatac 



acggttgaaa 

gcatagcagc 

gatcaatacc 

ctagatactg 

ttcctagtag 

cagctaaggt 

cataaacctg 

tgataggaaa 

agcagggtga 

gttagattgg 

ttcttagcaa 

agcaatgttt 

caagactctg 

ctggataatt 

ctaaaggagc- 

ageaataege 

gaaaaataat 

atttatgtac 

caeggcatgt 

aaaactggaa 

gtaagataaa 

tgaaagacac 

acagcataat 

tgcatctgt t 

agagaagctc 

agtaaagtag 

aatcaattag 

tctttatttt 

atgttgaaga 

aaaaaaacat 

tacttacatc 

ttggtagatc 

taaatgtatc 

tcttgaaagg 

tatgegttea 

ctaacctcta 

gacttgtcag 

ttgeatatta 

atatagaagt 

tgtgtgcaaa 

aaaactgttc 

gcagaaaagt 

agaacaaagg 

taatcccctt 

tatattctac 

aataaaagag 

agagtagctt 

atacttgttc 

gagtaaagtg 

gggaaacttt 

ccacctaaac 

ataggctcaa 

tggtgtagtc 

cggtggcaac 

aacaaaaaag 

atagaggaag 

aacaagtaac 

tcagttggtt 

gagtaagtct 

aacaagagaa 

aaaagcaaag 



tatgacttac 
aaagaagtct 
ctgtaacata 
gatacacaag 
agttttgtac 
tttttgetta 
aaacgaaaac 
tgaaactaaa 
tagecaggea 
aacccaatac 
acagaaact t 
atcaacacga 
accttgatgt 
aaacaacaga 
acaagactag 
tttacagtac 
cattagctag 
ctgatcaegg 
caccctatca 
gatccaacca 
tgaggctgac 
ggcataagat 
gtacgattac 
gcagtttctg 
atatatttcc 
aattacgttc 
tttgattatt 
ggatagaat t 
gatccaggtc 
cagattcgat 
ttcatcccac 
ttctgcagga 
atctctgctc 
atcctcccat 
gtaatgacaa 
catctccttg 
cttcaacacg 
aaaacactat 
ggcaaccaac 
aattcccaga 
tggtagatca 
ttacatcaag 
actcctcggc 
cctaatcaaa 
ectaeggagt 
tatgtatget 
caagtcatgc 
agaatgtgat 
ttaatgtcaa 
gatcttggag 
aaacaaacaa 
actgaatttt 
cttgagaaag 
agaacaaatc 
atggatttaa 
gaaagggaga 
ctgagacaga 
gagaaaacga 
etttgetgaa 
aactgaaaga 
ctaaccctaa 



cattgeaaga 
tcacaggt ca 
tgcacaaaca 
aaaaaagagt 
atcctcacaa 
ctgcatataa 
aaaaaccaaa 
tagagaaaaa 
cgtaatccaa 
aaatagtaac 
gcctgtgtag 
aegtaacatg 
atgaatgtgg 
ggattggctt 
gtacctgegg 
ggcttcccatf 
agaaaaattt 
aagacagtta 
tcacaaat ag 
acaagaat ca 
ctgt tgcctg 
t tcaatagaa 
atacttcatg 
gagatgecag 
atcaagacaa 
aaactcttgt 
ctcgacagta 
cactatttgt 
atactcaagg 
acctaacaaa 
ttcatggccg 
gtccagattt 
gctagctgtc 
aaggcaaagt 
agtgagagga 
agaaatgagg 
gcatgtgtac 
gtaagaacat 
ctggtaaaat 
tgacttgtag 
ttaaagatag 
tgaagtattc 
cggcagtgaa 
gtgaaagaga 
tgat tt tact 
taaattgata 
ccattcaaga 
gtgaaaaaaa 
aaactagt aa 
ctaacaattg 
acaaaaagag 
aaagaaggga 
atttccttag 
agcctcctgc 
ttttgettea 
gttacatcga 
gaaacaaaac 
aagaaacaca 
ttacgtaacg 
taaattaaac 
acaacatatg 
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204841 catattatac gagtgatcaa aaaagtcaaa gctagtggcc atatacaaac tcaatcattt 
204901 cgatttagaa tctt taacca aaggaaccaa aaaacattaa tccaaaacgc aattgtaaaa 
204961 ctttaacaaa cattaatcga tttggacctt taaccgaagg aacaacaaaa gaaatcaaaa 
205021 caagagagac agagagagag tttagagagt aatcacagga ggagcacgag agatcaaacc 
205081 gagaaggttg aacctagtgg ccaaagacga cgagcttcga caaggaagta tcaatcgagc 
205141 cataacaccc aaaaatcaag ctattaaaac cctagaaaac actctccact acagagaatc 
205201 aatcaaaaca cacagtaaga gaaaaccaga gcgacaaggc tagggctgcg gactttataa 
205261 gggatgtgtt aatcggatag catttaaatg aaaaagaaaa ttatggccac gaaaattcaa 
205321 tatrtaatatt aacaaacaaa caaacaaaaa attgatcttg cgattcgagt gatgatcaaa 
205381 atcgatagga ggaagaagac gagattaata cccgagatct gcgattatca gatgaaggag 
205441 gcacaactca tataatcaca agaagaaaaa tgctttgctt tgcgtgctct gctgatatat 
205501 ttatataacc aaaaaaaaaa gatatttttc gctaacaaaa gaaaaaggat aaaaaggaat 
205561 aacagaaatt atttgtattt tattattgtg tcttatccaa tttcctatcg ctgcacttta 
205621 gcacctcgag gttactctat tccaaaaaaa atcagtgttt ctgaaaaaaa aaaatatttc 
205681 caaaacatat aactgtgact aataaaattt_ aattggttaa caaatataga aaataactaa 
20574-1 tgacaaaaaa ataatgtata agactattag taatcaaaat tttatatttt ttttggatat 
205801 agtaagaact cgaaaacata cattttttga aatagagagt aatttatttt tggtcaacaa 
205861 ctcaactgaa gaagagaggt gtattttact atttcaaccg gataaagaag aggttacaac 
205921 aaatcgatgc taattctagt taaccaataa ccaaaatcgc aattttctct aacatatcat 
205981 ggtaaacaga gtttgacaaa gacttgttga agtatagttg gcttcataag cggtgcattt 
206041 aagaacaact tatttgaaaa attaaaattt catcaataac attattgatc tgtttcagca 
206101 tattatatcg gcaacaatat gcaattgcct caaccacata gaaatggcct agaaccatag 
206161 aaaaaaaaat ctcaacaaat agcttttttc ttgtataaaa acaacccctt atagcctaat 
206221 tagctaatgc cttcacagct ttctctgctg cgtcatccaa atcatcagct gttatgagtt 
206281 tcattccact ttccttgagg attctctttc cttgttcaac atttgttcct tccagacgaa 
206341 ccaccactgg tactttcagt gacacctatc atgtggggtt tggtttcgca caaaactcta 
206401 tcaattaact atttatgttt atatatgatg aaatatgtat gcttcgtttt gctcacctct 
206461 ttagcagcat tcacaattcc actagcaatc acatcacatt tcattatccc accgaatatg 
206521 ttcaccaata ttgctttcac tttatcgtcc gatgtcagta tcttaaacgc ctccaccacc 
206581 tattattatc acagcctttt ggttaaactg gtttaaacct aacaagcttg atagccagag 
206641 atactacgaa gaacatgttt tccttacctg gtgttcagaa gcgtttccac caacgtcaag 
206701 gaaatttgcg ggagtcccac cgtgcagttt aatgatgtcc atggttgcat ggccaatcca 
206761 gcaccattca ccatgcagcc aatctctcca tctaaaccga tatagttcag gtccactttt 
206821 gcagcagcca cctacgagat gaaacgacac actttccttc atcaagcttg ataataaaaa 
206881 ctaaaagtgt aacataattt tttattgtaa cctttctgat ttatttaacc ttaactggta 
206941 tatacacatt atcttgtgta caggaagagt attactaact ggcaaactaa tcaacaagct 
207001 gactaacaac gagaaaacca ataacaacaa aaactattta ccctatccag attcatgata 
207061 ccattttcta gagttgaaca tccataagct ttgatataac ataggtttct tggtgaagaa 
207121 catacgatac gatgcaatgg accatacctc tcgtggatcc tcctgcgttg gatcacgaag 
207181 ggcaaaaatc tctttctgac ggaaagcagc attatcatca aagttcaact tagcatcagc 
207241 agctaccaat tggttcgtgg atgtctcagc gagagggttg atctaacaga gaaaaagtaa 
207301 gggttcaaca aaagatattg aacttgtgta taataaacaa gacacagaga gaatgctcac 
207361 ttccaacata gtgcagtcac tcttgcggaa gagttcgtaa agcttcttca cttgttcaat 
207421 cgaatctttt ctgtcagcag cttttggagc cagaccatcc acaaccttgg cagcatcctc 
207481 atctgtaata cctgcaaata catcgatcgg taectacaga_ aaaccatgga atgagggtca 
207541 aaattggctc aaccagagta ggaatacttg tggtgatgaa acacaaatca aaccttaata 
207601 atcatgtcag ggaacttctc agcaagatct tcaatgctgg taccaccctt tttacaggca 
207661 attataagct gaaagaaggt attaaagaga tgaacgacaa gatcattcct gagcagtaag 
207721 agaaatggat tttgcatatt tattattttc aggcagcagc ataccggtcc agcagattta ' 
207781 cggtcgagaa taatggaaaa gtacatctca ttgacgagtg acaacttctc acacaagtag 
207841 acctgctcaa aaaaaaataa aaaaaaaata taggaaacaa gaccgctgag caaaacttac 
207901 aaaacgctaa aaatcagtat gatcttcatc aaacacacaa accttgctga ctactttgcc 
207961 ttgaggacca gtttgtttgg tgacgagaac ttgcccaagc atcttgccta caaaaagata 
208021 ataataataa caataacaac aacctgaatc ggatcaagaa actctgatga gagaaaaaag 
208 081 aaagtgcgtg tgcaaaacaa gtaccagcaa tatcttgaac ctgatcacgt ttgacaatgt 
208141 gaacaccacc ttgaagacca ctcttgaaag tcccgagacc tcttccacca gccaagatct 
208201 ggctcttaac gaccaactgt atattaacaa cacgaccctc ccatgagaca aaacaaacta 
208261 tttgttggca ataatgaaag ctcttataca gtacctcggt ttcattaggg aaaacttgtt 
208321 ggatagcgtt tttgacttca tcgagggaag agacagcgac tccgttgggc acgtttactc 
208381 cgtatttacc catcagctct gctccctgtc agagagataa ttgatattca aaaataaaag 
208441 aattacagat tcgatgaaga cagccttgtc gttcgtcaga tcgtgtccta atttcgcatt 



146 



WO 03/006622 



PCT/US02/22217 



2 08 501 cggaaacagt tgggaaagcg aatctaggta gagagaaaaa aaggaaggag aagaacctga 
208561 tactcgtgga tgttgagacg gcgaagctgt tggtgctgcc atttgccgga gacggagaga 
208621 gatctggaca cgagcttctt caccaatccc ctcatccttt tgatttttgt cttacggtaa 
208681 aaaaaaatga aaatcgatca gagaagtata gagagagaga gaggaagaga atagtcgcag 
208741 cgagtctagg gtttttaatt ttctcaaatg atcgatcttt ctcccttaag aaatgagagt 
208801 atttcgtttg tttttctaag aaaaaaaaaa cattccataa aaatacaata caaatataac 
208861 ttctgtcttt tcttttccac aaaaatcaca aataaataag tactattggc caattggccc 
208921 aatcccggac ttttctactt atttttgact aattttattc acaaaaggta cataaaatta 
208981 actgaagaaa ctagggatag atgttcggtt tcggactttc aaatattttg atatatgaat 
209041 atagagcttc tttgagtatc tttaaacttt ggatcggata ctgatattac agaccggatt 
209101 tgattattta aagttattta gaaaataaga gcaagataaa catagttctg ttatgtacca 
209161 gattgtgatc gacataaccg gaccgaccag gactgtaccc accataggag ataatgctta 
209221 tcgactgttg tacttatcca cttaggataa acacgacctg atgtatctat atatgtaaga 
209281 ctcttggttg agattaataa caagaaacac^gtttccccac ttagttttat aacacgttat 
209341-cagcacgaga ctctgaaatc _ccgagctac~c tcaaaaaccc taattgacgg caccacttca 
209401 aacagttctt tctctccaac cgtaaggatc caaacgatga tccacatatc agattgaagc 
209461 tcttgacgag acgaatccaa cgccgtcggc cacgcctcta tctgacttcg gacgcgccct 
209521 cacgctctat tataaaacgc gacgtcaagt gatctaaaac cctgaaattc taaaactcat 
209581 ccgacgactt caaacagttt gttctctcaa accgtaagga tccagacgac gcgtaatata 
209641 tcaaatcgca gctctcgacg aaacgaatcc aacgccgtaa acctcgtgtc aatccgatct 
209701 cggacgctcc ctcacactct gtttcaatac gcgccgccag taaccctaaa aactttaatt 
209761 ttggcgcaac ttcagaaaac gagtaatata tcaaattgaa gctcttgacg agacgaacag 
209821 atcaccataa accttgtgtc gatccgatct tagacgcgct ttcacgctct atttcaataa 
209881 gcgccgccag aaaccctaaa aaccctaaaa ctctaatctc ctcttttgtt ttacgatttg 
209941 tgtttgatca ttattttaac ttgtttttga ctcttatgtt taaggtctaa ccaagatcaa 
210001 acccttggtt catagattcc attcaaccca attggaggtt aagaagaggc gaccaatttt 
210061 ttatccgttc cggtcatgca gttcgcgatc cggctcgttc ctgctctcag tggtccggtt 
210121 ctacaaacat caaagtttag gtaattaaaa ttctgaaatc taaaagagaa cccatactga 
210181 atttggttgt attgataaag gtttaagact aataaaattt gcaaaacccc aaaccctaag 
210241 ataatagatc ccaaaatcct ttgagtaagt tggagctttc aagtccaaca gagattgttt 
210301 ctaacaatta aactcaaaac cctaatggtt ttgaatctca cagcttgatc tattgatcta 
210361 taaagtaact aaaaccctaa atatgaccta ttatgtatta tggtctaaaa attaattaaa 
210421 attcatgata atctcctaaa aatccccaaa gttagttttc catcatcaga atccgaccca 
210481 ttggtctgga actatgtgaa aaacgatttt tcggtttctg gtcagaaatt gaccccttca 
210541 aataaattcc gaaaaattaa ttaaaattca ccaaaaacca tgataaatcc taactaagaa 
210601 accatataag tagtttcttc agattcgttt ttaatcacat agtcatagtg taaatttttc 
210661 ataccgaaaa agtgcataaa aagtgtgtga ttgtttgaat tatttgaata acctagaaac 
210721 gattgttagg atctcattct gattttggtt gaatcattgg agattgcttg aaaaatttcg 
210781 gatttttcca gattgcaaag ttgcaaattt tacttttttc aaaactttcc ggttttgtga 
210841 aaattaactt tttatggttt ctgaaaactt ccatattagt tttagaataa tggaagattg 
210901 gtaaaactag tttaaaacca taatagaacg ttttccagat tgatagaatg caaaattaac 
210961 ttttctaaaa acttctattt ttgagaaatt gtcttttata agttgctgga aacttttcat 
211021 ttttcttttg gaagattaga aaaatgctat aattgttaag attgatctga ttttttttta 
211081 agtcatataa tggtttcgaa aataattatt aaaaaatgaa accatatatt ttcgaaaccc 
211141 taaacccttt tttctctctc tttagaatcc gataatattt " taaatcatct agaatcaatt 
211201 atttgatctg atttatattt ttttccatct tgatcatgta gacaatgatt gctaaggttg 
211261 catcataaaa catttttttt aattgattga tcatatgaga aaatcagatt gctagattga 
211321 aagaaaaatt ggattgcatt gcgtaaattc agaggttgaa agaaactaaa atctgcattg 
211381 ctatattgac taaaaaaaaa ttggattgaa ttataaataa atggttgaaa gaaaccaaat 
211441 cacattgcat aaataaaagg ttgaaagaaa ccaaaatgca ttgtttgaat ccgattataa 
211501 gtcaaataat aagactctaa aatctatatt ttcagatgtc gaatttggat tatcaagccc 
211561 ttaatctctc tggagataat tatttaaaat gggctatgaa cactgcaatt gtcctgaaga 
211621 taagaggact tgacagatgt atcatcaaag gcgagtatgc agctgaaaat gaaaaat.atg 
211681 . gggcaataac aattattcgc caacatctca ctgaggatct cagagatcag tatctaaata 
211741 ttgcgaaccc tctagacctt cggacagagt taaaatccag atacacaata gtgtcattac 
211801 caaaatttat aactcaaatg ggtcttttat ttgagtggat aaatctcaga tttcgggact 
211861 ttaggtccgt agatgaatat aactcagctc taatcaaaat cgtttctaaa ttgaaactat 
211921 gtggtgaaga ggtaacagag gaagatttac tggaaaagtc attcctcgca gctgatccaa 
211981 gggatctatt gttacaatat acctacagaa aaaatgtttc accacttata cgaatttgat 
212041 ctcgtatcta ttacaagctg agaagaataa tgagatacta aagaaaacca* gtgagatgag 
212101 actttctgaa gccaataagg ctggagagaa taagggtgaa tccaaagaag ccacgtccag 
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212161 aataataaaa ggagtggtcg gcttatacat tgcctaagaa tcgagatttc gacattctga 
212221 ttttatgttt tgatttgttt gtcatttata ttttttttat ataataagaa ttgttttagt 
212281 tttatattat catgtttgac ttgcttgata atttcttgaa tgataaatga caaatgattt 
212341 tttttttaag aaaaaagagt atagtaatta aaattatccg accaaaggca ttgcctaaag 
212401 ggggcatgaa ttattcaccc aaataaaaga cccatttgag ttataaattt ttgaaaatga 
212461 atggtttcca cattgaacca atgggcaaaa gaaatataaa ttccttaaga ttataaagaa 
212521 agtaaaaatc gcccaaggca taaaaatggt cgagactgta ctcccaactg atctaaacta 
212581 tgctaaaaga ttagtatgat aaaggcaaaa ggcctagata accagatagg ttgaccacga 
212641 aattttatac actttatggc atgactggac tagccatcca agtatttaaa attgatgcaa 
212701 agattgatat cgaaaaaagg cacaaaagag ttatcccata gaatctcacg ttgtgtaaca 
212761 tgtacacaag ggaaactcaa taggctataa tgttccaagc atctaaaata tttatagcat 
212821 gtacatgagg gggagaaatg acactcccgt ggtccatgaa caacactaaa accagtgaaa 
212881 aataggtcat gtgtgggaaa gaaaaagaaa tcgtgagaca tggactaagg tgttcatatt 
212941 catatttaca gcattcaaag aatggcttga ttatacaagg atactcacaa agaccaagga 
213001 acagattata aggagatgta ctcct'atgtg gtggatgcta ctacaaattc gaaaatgata 
"213061 aaggtctgga tataagaaaa gagaaatgta gtaagcagca tgattgatca ctggataaag 
213121 atgataagag tatcagaaag atgagaaaag aaattctcat agaatagttt tgttcctaag 
213181 ctattcatgg attgaaacaa ggcagttgca aatgatagac taagaaaata gttagtacaa 
213241 tcagtccata gatccttata gaggatatta taattccttg tgtttatgtt tataccaaac 
213301 caacctaaag agaggttcaa tggttcaatg atttcaatga tacaagttat cgattgattt 
213361 tggacagaat atgatgggag tagaaacctt agccatatat gagtatattg gcgtgtgatg 
213421 acaaggtcta tgacttaaca tgtatgtttt cgaaaacata gcattgtcta cgacaaaatg 
213481 aaagaagtca tgagacatca tccagagaaa gtgaccagaa gaatgtctga gtcaataata 
213541 aaatgaaagt ttccacggta tggcaaagcc atgagactag aggaaccgtg ggacatgaga 
213601 ggacctgcaa gaaagattta atcgcaggat agaggtacat ccaagtactg gtcgagtact 
213661 atccgagtac tgattgagca tcatccatcc gaccagaaca tgaagacatt gtcgagtggt 
213721 cagcatcaaa ggagcacgtc ttgaccatgt ccaaatgaag ttccagaaag ccggcgaaat 
213781 tacaaagtat tacaagaagc tcatcgacca gataggaatg catcgtccaa agatctacag 
213841 tgatgcattc atcaggggga gttcatgtgt tgtacttttt ttcctgtcca tggttttcca 
213901 ttttgccaca ttggttttag ggttttccag gagaggtttt aatgaggcaa cattaagcat 
213961 gcaacgaacc agtactggat gtgtcgatca aagaggagtg ttatgtacca gattgtgatc 
214021 gacataaccg gaccgaccag gaccgtaccg accgtaggag ataatgctta tcgactgttg 
214081 tacttatcca cttaggataa acacgacctg atgtatatat atatgtaaga ctcttggtcg 
214141 agattaataa caagaaacac gtttccccac ttagttttat aacaagttca taaatcaaaa 
214201 caaaaaaaaa agtgaatcat ataataattt gtaagttgta agatgaaacg atatagtatc 
214261 aaataaacaa aaacaaaata aatagttaat caaagagcag aacatcaagt aatcaaagat 
214321 aaaaagcaat agttcactga agtaatctta gaaaaagaat atccctccgt aaattgaaaa 
214381 gtatgattag atttaatgtg aactattgaa aagtgtgatt agatttaatg tgaactataa 
214441 agaattagtt taatttatca atatttataa attcaaacct ctaattcata tatacatgat 
214501 ccttgatcat tatcaagacc gtctaccaga cattatatgg taagactgtt tggtactgat 
214561 atcaagggaa aacacaacgg atcgcaagtg tttgggtgca aagtacttga tcttggcgtg 
214621 aatgctgaag agaaagatat gttatctcat tgttgagtct atcatgtcag taaaacaaac 
214681 agagtaaaaa gatcaaagtt gacaagtatt cttagcttgg caatgacgat acaaaaacaa 
214741 atctgtagtg acctcaatta acaacaaagg gttcattcac aagaacggtt tctagatggt 
214801 ccagtatatt cttggcttga caagaaccaa agtattggtc tgcatcttga gaaagtatcc 
214861 tacagcccaa gagctttctt catcaaacgt acgcagaagt ggaatcactc gattgagagc 
214921 gaacgaaact atgaacttct tgggtaagag cttcggcact gaggagacag agcgtgagaa 
214981 acattatgaa gtgccgtcac aaacaaatct accagaaact tgtaatgaaa cgaaattacg 
215041 actcaaaaga tacacagaga agaaaccgag aaacctctgt ttcagtctct tcaccagttt 
215101 actgtattgg ttgtattggt attttttttt tttgagggga gaaaacgaca gcatattgga 
215161 aacaagtcat tgtggggaga aaaacgatgg cgtattaagt gtgagatgtt gtttatttac 
215221 gtcccatatc gttaatagac agggaagtca agctatttat aaagtgcaga gtcgaagcta 
215281' gaggtcacga ccttacttga acaggatctg ttctataggc tcgtacctct gtatccttga 
215341 tttctaagga gacaggccct caaacctggt tgatgaatca aggccgtgcg atctgagtgt 
215401 gattaacggt tgtgatactc ctcggaggct ccgcactgta gaggatcgct atccggttag 
215461 gtcttgatct ctccagagtg gtcgcacggt tgtctgacag gctttctttt ttcgtttatg 
215521 agagtatggt tggcattgat gaggatgtta taactttcaa ggcataaacc tcaacagtcg 
215581 cctcacctac ttgagttata ttgcttttgt tagcagagtt tcactctgtc atattctttg 
215641 ggttctcgct gtaagaaaaa tgattaaagc tttcgttgat tctacatttc tcatattgtt 
215701 atttctggca tttttgccat tgtgatagca gacttttgaa ctttcttgct ggtccgggta 
215761 ctctctttac ctcaggggca gcctttgaaa taggcatatg atcatgtggg aagaaactca 
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215821 
215881 
215941 
216001 
216061 
216121 
216181 
216241 
216301 
216361 
216421 
216481 
216541 
216601 
216661 
216721 
216781 
216841 
216901 
216961 
217021 
217081 
217141 
217201 
217261 
217321 
217381 
217441 
217501 
217561 
217621 
217681 
217741 
217801 
217861 
217921 
217981 
218041 
218101 
218161 
218221 
218281 
218341 
218401 
218461 
218521 
218581 
218641 
218701 
218761 
218821 
218881 
218941 
219001 
219061 
219121 
219181 
219241 
219301 
219361 
219421 



gactttaggg 
aaaaat atgg 
aaaaatttgc 
tcaaaactat 
catgtt tatt 
actcattcag 
cttataattt 
gttatgatac 
aaaaaataat 
ctgtttggat 
atggtggtca 
actaaccaaa 
aacataatct 
tcagtatatc 
cccatcagtg 
acaaacagag 
gtatttgcca 
aataggtcat 
tcccat at eg 
aggt cacgac 
ttctaaggag 
attaaeggtt 
tcttgatctc 
ctttttagtg 
cggtaatgga 
acattact tg 
actacactct 
aagctgaaac 
ctcgacttat 
tgttatgatt 
atcactgaaa 
tgaactgtt t 
tggacacatg 
tagt tactac 
aaactcatca 
geaegteggt 
gaaattaaag 
atcatcaatc 
ctaagaaccc 
accaagaaac 
aacaaaccaa 
ccgcaactca 
tttat catac 
caagaatcca 
acctgaaaga 
ttggaactgg 
aaaatacagt 
ctgttctaac 
cagacatct t 
gctggagttg 
acttaccaca 
aaggecatea 
acgcgccatc 
gggtattgtg 
actcatctac 
aggcatgaat 
attagagctg 
caaaactaga 
tatatacctg 
ggcaattaaa 
agaaaacaag 



aacaaatatt 

gcaagtggac 

tgatctctga 

cccacaattg 

agaatgatgc 

acaagagtga 

tattttctta 

aaaacacaag 

taaaattata 

ggaaccttga 

caggatcatg 

ccttaaacca 

ctaatgtaca 

ttcccctgag 

attcacgcgc 

ttaaaagatc 

tttggtattg 

ttggggggac 

ttaatagaca 

cttacttgaa 

acaggccctc 

gcgatactcc 

tccgtagtgg 

cactgaggtt 

cagagcagtc 

aaceggatet 

tttaagtgtt 

tcattcagac 

aaatttgttt 

caaaacacaa 

actagttgag 

ggatagaagc 

atcatgttga 

caaaccataa 

ctttttttta 

gctaattcta 

gtgaaacaaa 

atcattattg 

attccatctc 

atcatttgat 

atcgaatcaa 

aaccgaggac 

acatgat tec 

cacaaacaaa 

aatataatca 

gaacatgtgt 

aaacaaactc 

agaatatacc 

ccccgttgct 

t taaaaaaag 

ttaagtaatt 

ggcctaccac 

tcaataatct 

ttgtataaaa 

acgttcaaat 

gcgatagtta 

aact taacga 

ttatataaga 

actaccacga 

ggttataaat 

ccacacaata 



ggaccgggaa 

atgtactgat 

gttaatttgt 

cttctttatc 

agatataagt 

tctagattgt 

gtagaatgat 

ttgggcagac 

caaatagact 

aaaaggecta 

tggaaaaaga 

taaaacaaca 

aagttcaaaa 

ccaccaggtt 

ggttt ccact 

aacattcacg 

gtattttttc 

caaacgatgg 

gagaagtcaa 

caggatctgt 

taacctggtt 

teggaggetc 

tegcaeggtt 

agaaaccgea 

gtgttgttta 

gttctataga 

tgtacatata 

aagagtgatc 

cataatagaa 

gttgggcaga 

attataaaaa 

ttgagaaagt 

aaaagactgg 

cgattcaata 

ttcatttgaa 

atcat ccaag 

aactttgtaa 

gecgataaag 

tgcttcctct 

atttaaggaa 

gacagaataa 

cgacttcttt 

tatttttata 

caaacgaaaa 

gatttattac 

tttaacatgc 

tacaaaactg 

tggcttgtaa 

ttcttgett t 

gaaagatttg 

gaaagtaagc 

ggttctcagt 

cagaaatgac 

gtcactacaa 

ctttgaacct 

cagecagata 

caaatatgag 

tgatgattag 

t tcattctct 

caatcaccaa 

attaatggta 



gaaaaaaggg 

tagcaatatt 

gaagttgaaa 

atgagtcata 

catttgggct 

tactttctgc 

gagcagaaga 

ccttttctga 

cagttttcta 

tcaaaactca 

ctggaaggat 

tattgatctt 

tgattccaaa 

tactcaaatg 

tcccatagtt 

tcttatttta 

ttttgagggg 

cgtattaagt 

gctat ctata 

tctataggct 

gatgaatcat 

cgcactgtag 

gtctgacagg 

atgtataaag 

tagaacgaag 

ctegttat tt 

aggcacttga 

tacagagata 

ttatgagaag 

cccttttctg 

cagtctcagt 

cctatcaaaa 

aggtattacc 

taccctaaga 

cgttgtatat 

aatcaaaatc 

agtgtttgtt 

cggcgttttt 

tgttccattg 

cctgattgat 

accgaaacaa 

taactctatc 

tatatatata 

catgeaatat 

ggaaagtaac 

tcccaactct 

gtttacattg 

agttgtcaaa 

ggatatcata 

aaaagacaaa 

agttggtccg 

aacaaatatc 

ctgcacaaag 

gtccagaata 

atgttacgcg 

acaaaatggt 

agaaaccaca 

atttagtaca 

tggcgttagg 

gccccgaaac 

aacatctatt 



atagaagatc 

attaccaaac 

catattataa 

t ctgtatttc 

ctgtgtcttc 

ctgccgtttt 

acttaggtga 

gatattgaga 

ccttcttaga 

caagtcacaa 

gggattggta 

ttttttttaa 

atgctgttgt 

gccggacccc 

gagtgtgtta 

gtctcttcat 

agaaaagggc 

gtgagatgtt 

aagtgcagag 

cgtacctctg 

ggccgtgcga 

aggategcta 

ccttttttta 

cagtgttgtt 

ccaagaaggt 

ttaagctttg 

gctctgtgtc 

tttctgcctg 

aacttatgtg 

agatattt ta 

ttgctaccgt 

actcactagc 

gtattggatt 

gaattgecac 

tgtattatcc 

gcaatttttc 

tagaagcegt 

tagcagagtt 

ttcctaattt 

atatattcat 

tgcactttta 

gattttgggc* 

tgcccagaat 

cttaggacaa 

tataegtett 

tttacccatc 

gagcaaagaa 

aatgacgtaa 

ccttgttacc 

aatcattagg 

aatgggagat 

acatcactat 

aaacaattca 

ctcaaactcc 

taactatctg 

ccaaaaggga 

aggagtgacc 

gaaaggcaga 

aaatacattc 

gttgtaaaaa 

caacgagagt 



cataaagt tg 

ct tgaagcat 

tgtaacatgc 

tgcctgcagc 

taaagct cag 

gattctcaaa 

actctgggat 

cacataactg 

ttgtaatgaa 

geagecttga 

ctgattagtt 

gttcatttga 

agttatgaaa 

gcagctcaag 

tgttagcaaa 

ctgetttact 

tgcgtattga 

gtttattacg 

tcgaaggtgg 

tatccttgat 

tttgagtgtg 

teeggttagg 

gctttgattt 

tttttct ccc 

cgaggtcacg 

gt tgatget t 

gtgtcttctg 

tegtttgact 

aactctggga 

agaaacacat 

agattgtaaa 

agctgtgaaa 

tgtactgatt 

atttactttt 

tttttagtta 

tcaaaacaac 

gcatgtgagt 

catgtagggt 

cacctgcatc 

tatacaattt 

cctggtagag 

ctccttctcc 

cattttctat 

aaaaacttgt 

gcatccagtt 

tgccaaagaa 

ggaagtgaat 

ggatattget 

taaaacaaag 

atgcagaggt 

gagaaacgat 

aatcatgega 

gtatataaat 

taatagactt 

atgtaagtaa 

gctaaatacc 

aattggaaaa 

agatgacaaa 

ttcaactcct 

gtaacacgga 

actgaaacta 
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219481 aattcaatat atatatatat atatatacct taacgaatcg cgttagagga gcacttggat 
219541 ccctagacgt agtcatcaaa atcttgggat ctttttccgt tgcttttgca tattcatcat 
219601 caatatgact ccctggaact actttcacaa catgattaga caaagattat ccacttttag 
219661 cgacaaacca caataatgac- actgacaaca agagtttcaa acgaatgagc gtatatacta 
219721 actacacaaa gtatgaacgt atactaatta cgcaaagaat gaacgtatac taattacgca 
219781 agttcgaaca gaatatattg aactatgtat tgaaacaaca ataacttgga acctcgagct 
219841 tattaacaat aagctccaat tatataacac aagtagaatt tagtacacaa caggcttcac 
219901 aaaaaaacat gaggattctt accggaggtg ttttgatctt caagatcaat ctcctcacga 
219961 agcttcgcct cctcgtttcg gagctcagat ggaatcggct tcccttctgt tcaaacataa 
220021 acccaaatca caacaaaaaa ttatagattg tttaagagag ttaaaagaaa aaaataaaaa 
220081 atcagtgtgg accttgcagg gcttccctta tcttccgctt cttctcatag agccgacgct 
220141 cttcaccttc caagtttttc atgtagagat actcctttct taacctaatg ttcctggcaa 
220201 acatcttgat tcgtcgtgag tgacagagag agagagagag agggtggggc taccaaggaa 
220261 ccagtttggt aggaaagggt ttcaaaagat gcaggagggg aaggcaacta caagattgcg 
220321 acagtcttat aat aataaaa-aacggcaagg tgtttgtcta tgaaaaggtg aaaccattag 
220381 caagctactt ttaggccact tagtatgtgg gctctaaata ggccacaaaa caacactacg- 
220441 aaatttttga cagtgacgca gggtttaact attgagttgg tgacccaatc tctcaaatta 
220501 gtgtacagat g'tgtccttgt gcttgtatat attttctatt gttccatcta gaccagattc 
220561 ttacacctaa ttaatagtta tcttagtcac tctgttgtat aactagtgtc ctttcgtcac 
220621 acaagtgtgt taggtatgtg ccggtgtttt tagacatgat tcagatttgc ggttaaaccg 
220681 gtaaatacgg tgatctacaa ctaattcagt ttatattcaa cgaaaatcct tcaattaaaa 
220741 cccaacaaaa tctaataaaa ccaagaaact agtattaact tgtgatccga cactggttaa 
220801 tccggtaaaa accacaaaat ttaaaactat tgcgagatca actttgatat tatcagtaat 
220861 aacaattttt atagtttatt cagatcgact ttgaatagta tctgatattc aaaatattgc 
220921 cctacatgat atattcatat atttttactg atttaatata tatatatata tatatatata 
220981 tatatatata tatatatata tatatatata tatatatata tgtttaagtt ttttggttga 
221041 tctcatttca attagtaatt gtcatgtcta cagacaactg gtttttgagt gtgtggtttg 
221101 gatgttagat aggagaattg gtgttagatc agtgaagata tgtcatacac atgatggact 
221161 aatatttatt tggactcatt tggaaagaaa cataagttat tgttttctag ataatattga 
221221 tatatataca acttggtagc atattattat tatgttcaga gctatttata aaatcagtga 
221281 gattcataat gcaatgtgaa aattagagtt gttgtacaaa attacagttt tgcaagatgt 
221341 tcaagttaga agatcgatta taaaatataa attttctgat tgaaaatctg ttacaaggta 
221401 aagttttgaa gaatgcatca ctaaattaac atcttaattt ctatttaatt tttgtagttt 
221461 gtgatgcata tttttatgta gtaattttta agtagttttg tcttgatttt gtaaactagt 
221521 tttggtatta gtgaaattgt tttttctttt gcttggagaa aaatcatcta ttttatataa 
221581 aattgaaata tgaaattggg catggttgtt tttcggttta gtttttagtt ttttttttgt 
221641 tcattttttt gttgttttag atatagaaac ctttcgtaaa tatgaaagac atatgcgata 
221701 acaataacag taactttttt tcttcttata aggctatgtt tattgcaggg tttttagggt 
221761 ggaattttta gactaatata aaaaatgtct cttaactttt aactaataaa aaataataac 
221821 cgtctcttat atagattctt agttttttta tttaaagtta aaagatctaa aaaacggttc 
221881 ttagtttttt cttatttaaa attaagagac agtttttata ttaagctaaa aacattataa 
221941 ggtctgccct tcaccatctg atttttacct tccagtattt actgcggcta actaaatcaa 
222001 atgtaactta cagatggctt ccagattcct ctcgagtttg gagagattac caatggtaag 
222061 tcactctcac ttgaataata ataattggat ataacgaagt gttctatagt ccacacatta 
222121 tatagcacaa agaatctgaa gcgaaaaaac tgttttaaga ggcaaacaaa gtggacactt 
.222181 gtaagttgtg agttgaatct tctccttgtc tcaccctttt gcgtgtgtgt tgtgtgtggc 
222241 cactaatgta gcaacaaaaa aaaaaacaga gcacgttggc agggaagagc ccagacacga 
222301 agagagggag agatcttatc catggcggct attactagtc tccaagcaat ccatctcaaa 
222361 ctcgggagac gtggcagcat ccgatgtggg atctcggagc cgagcggaga gccagctccg 
222421 gtagggcaga agactagata caacgatggc ttagctgaga gagtgttcat ggggctgttc 
222481 gcgaggaaga tggacaagtt tggtggctcg aagaagaaga agaaggatga gacgaaggag 
222541 aaagagtttt gggaatacga ctacgagagc ttcgtcgagg tttcaaaggg agtgatgcaa 
222601 ggacgatcaa gggcacagca gcaagaggtt gtgagagagg ttcttctctc catgctccct 
222 661 cccggcgctc ctgaacagtt tagaaagctc ttcccaccga cgaaatgggc tgcggagttc 
222721 aatgcagctc ttacggtgcc tttctttcac tggctggttg gtccttctca ggtcatagaa 
onon^ Stggaagtga atggtgtgaa acagagaagt ggggttcgta taaagaaatg caggtacgct 
222841 gtgactacta cttgttcttg tcatgtttca aaattgtcac agactttgct tttgctcata 
oooo2n ct 9 tt 9atat cgaaggtatc tagagaacag tgggtgtgta ggaatgtgtg tgaatatgtg 
ooo.o caa 9 atccca acacaagatt tcttcaccaa tgagtttggt ctcccactca ccatgaaccc 
llllJ; aa 9taattaa tctttttcct caccatctaa tcccttcttg gggttttgct tacagtactt 
223081 gcctatttac tgttgtagat tttgaagata tgagttgcga gatgatatac ggacaagtgc 
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ctcctacatt 
cttacaaagg 
ggttctatag 
gatcagctag 
cagaaaagtc 
ctggttgagt 
cctctctcaa 
gaccgaggaa 
ttgtaaaacc 
caaacagaaa 
cccaacaaac 
tactagcttt 
tgtcatagag 
cccgttcgat 
atgcttttgt 
tatatatt tg 
cataacaaat 
tatgaaaaat 
cagctctctt 
cgt tttatac 
tgcgggtaga 
aatataaata 
aactatttta 
attataatat 
ttatttaaca 
tttccaaacg 
atccacaaac 
tatatttttt 
ttgttattat 
aaacataaat 
tcaataataa 
caccgaaata 
agtttaattt 
ttgtactcta 
attttatttt 
catttttagg 
gctccaccca 
tatcttaaaa 
tgtaaagaaa 
aaatgacgtt 
aatgtaaaga 
taatactttc 
aaggaaataa 
tttagtagtc 
catttcgtaa 
ctgaactaac 
aatgattaat 
aaaaaagtct 
ataataattt 
atataaaact 
gacctctccg 
taattacagt 
cttcactttt 
attaacaggg 
tgtcattctt 
gttttttaaa 
tcaatgtaat 
tttcttaaaa 
ctaataatta 
tttaaaaata 
aaaaattcca 



tgaagaagat 

cctcgtatct 

ccaatccgag 

atgtgctatt 

acaatattgt 

tgtgaaagta 

tctttatcgt 

tcatataaaa 

ttttgttagt 

cagtgaatgt 

aatcttgatg 

tgacccgtgc 

ataattattt 

gtgtttagca 

ttatt ctaat 

ttggttttat 

ttatattgca 

aacatacaaa 

ggttggtagc 

tgttctgaac 

taaaatataa 

gtaaaatgta 

tataatttaa 

tataatttta 

gctactgcat 

ctatactaat 

atccgcaacc 

ttgcaaattt 

ttaaaattca 

tacaatacgt 

taagcccaaa 

ttaggctatt 

taatggcata 

gt tt taatag 

attttcttat 

ttcttacaca 

aaaagttata 

aacctatcat 

aaaaatgttt 

gaattattat 

agataatata 

gaattatttt 

aatatataat 

ttcttacatt 

ataatgtcta 

ctgttaagag 

tttaaaattt 

agaccattgc 

tgatggcttc 

cttgaataat 

gcgtaatctc 

ggtatgaatt 

aataggatag 

gtagtttggt 

ttgtatatga 

gatgtatgtt 

ttttgtcaac 

ttttcttggt 

aattttaata 

gagggagtag 

aaaaaggaaa 



ccagccacaa 

tagtatgtaa 

ctccccagtc 

tccaaagata 

agagatttga 

tccgataaat 

ttgtggagtc 

gagattgaac 

tactagagag 

gatctgggag 

agactaccag 

"gtccgcacgg 

atgttttagt 

tccatttata 

tggt aaagta 

gaaatagtag- 

t tttaaaaaa 

agacggacat 

ggttattggc 

ctctgaaaaa 

cagt taaaaa 

tatattacat 

aatattaata 

ataatttata 

ttaatagtca 

tgtacaaatt 

gtaaatacaa 

tattaagaaa 

gtacttaata 

aatttataaa 

actaatctgt 

aaacgattta 

tgattgtaat 

tttagactag 

aaattgatta 

tccaaaccaa 

atatccaaag 

tacgtgaatg 

ttcattcaca 

gataaaaaca 

ataatttaaa 

gtaaagagaa 

aagttgtaaa 

ttctttgtaa 

ggagatattt 

tcaaataagt 

agtataatat 

caccagaatt 

gtgttttgat 

gtataaaat a 

aatccagtta 

gtaattattg 

atgttaataa 

gaaaaaagtt 

agtgcaaatt 

ttaggaattt 

taattatgaa 

ttagaaatat 

tgttttatta 

tttgtaatat 

aataaaatct 



aacaaccttg 

tctaacacat 

tgt cctaaac 

tacatttaca 

tactggttag 

acagacactt 

acatgtttca 

atagtgcttt 

acaaattaag 

gtgatcat ta 

aaacattaat 

gtttttattt 

tatatattca 

taaactataa 

aaattaaagt 

tat taatattf 

aagtatataa 

gactggtttt 

gttttagaac 

caaaaattgg 

tagaaatatt 

tttaatttat 

gaataatttt 

aatatataaa 

acagttataa 

ttaaaaatgt 

ccggtgcact 

gactaattta 

aatatacaat 

acaaatgctt 

ccaaaacata 

gtcttgttta 

tatttaggaa 

agtttgaccc 

atgacctt tt 

at ccaccaga 

agagt ctaat 

aatacatgaa 

tttggaatta 

attaatgaag 

tgggtaagtt 

attaataaag 

gttaattcta 

gggggactgt 

ataaaaacca 
gttcatat tc 
tatattaatt 
tgatttaata 
aaagtttaat 
ttaaacctga 
ttatttaaat 
tggaaaaatt 
tattatgaca 
caagaattca 
ttcaatatac 
tttttgtttc 
taattgtgaa 
aggaaatata 
aaaagtgtga 
aatgtacaag 
tctattttta 



tttagcagac 

gaggatgttg 

tacaagcatg 

tactttttct 

tattattatg 

gacct tactc 

gcttattaga 

tgt tagttac 

aaatgtggaa 

aaagcacatg 

ctgtgattgt 

atgtttataa 

tattcttgaa 

aattaggttt 

gtaaatgtta" 

tggactaaag 

gttaaacaga 

tccactacca 

aatcataaaa 

ttctagatag 

caaaaaaaaa 

attgtaaaac 

ataaatataa 

acgtgaatat 

gtatcccgca 

ttaaaaccat 

tataccagtt 

aattaaacaa 

attgatcgtc 

tgtgattata 

ttatttatt c 

ggaaatacat 

aatgtagggt 

gtacgtccgt 

taaacatata 

accttaaaac 

ttcaaaaccc 

tatatatgcc 

ttatgattaa 

tagttaaaag 

ctattttgta 

taattaaaat 

tttgtagtca 

agaaaaaata 

ttttgtaatc 

ttcaatt tta 

gttcgagact 

tattaaactg 

tcaaatgatt 

acgtaatgtt 

taatcat taa 

agggttattt 

aaatataggt 

tttttcaaat 

tgtagtatga 

aaaaagatgt 

tctcaaaaac 

aaataacaaa 

aaatctcaaa 

cctaaaatat 

ttttaagaat 



atatgtacgt 

cttgcctgca 

aacagcattg 

caaacataca 

tgtttctatt 

atctgcatcc 

ggattcgttt 

tttttttttc 

caaatgagta 

gaagttttgg 

taataattag 

cttttaccaa 

gaaattttgt 

tttcattata 

aagttttaaa 

gttgaatatg 

aacgt taaat 

ccct caaacg 

atgctacaat 

cgtttgcggt 

aaattgaaac 

tttaaaagga 

ttatatattt 

tgttaattta 

aatacatcaa 

aattacctgc 

atgtcttcag 

ttggcccatt 

taacagctac 

acttgggcca 

caaacaaaat 

taaatgatag 

taattcctat 

gcgggtgttg 

agtttttctt 

tccgacttga 

aaaaactcga 

taaatgatta 

cacataaaat 

tgtgaaaaag 

attgtataat" 

aagtgaaaat 

tgtcatcaat 

tgtagcaaat 

taggtgtaaa 

aaaatcaatt 

aacatt tcag 

ggcagataca 

at tgtataag 

tttggtaagt 

ttgtttgttt 

cctatttgta 

tcataaatag 

ttgcaaatta 

tactccctct 

ttttatattt 

at taat tgca 

aaactatgca 

acatgtatta 

catatggccc 

tgttctataa 
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226801 tctcagcatt ccagaagtga tttttagaaa ataaatattt gtccaaaaaa ttatatatat 
226861 ttcatctata ctatactaaa aggcaaatac tcttaaagga gagagtatcc acgttggatg 
226921 attaaattag ccaattagag agaagttatt tatccatgtc agcattgctc tgtcgacaac 
226981 aaatttggtc attcatatgg gcttctaaat ttcgaaatgg tccatgttaa gccttttcct 
227041 tccaagcttg atcgtccatg acctttacac ttcgtcttca ttgtttaact ctcagaaatt 
227101 cgccgttacc acgatcactc aaatcgattc aatatgttct ttactacttt cttcatgatt 
227161 tcgtttcacc tcctttgttt ttcttcctat ataaagcctt ttacatcaac ctaaacctgc 
227221 agcctccaca gaagaagata gagctggatt cagatttcat tcgatgaaat agttcaatga 
227281 gaactgcaac aggagagccg atctataagg taaatctaat atatagtttg ttggtaatgt 
227341 attattctct ctccctgtag gtgttagatt caattatccg acttatagct gagacaaaca 
227401 ttaacaccgt ccaaatacgc agattagcta atcgatggac aaaatctaca ggagctccaa 
227461 ctactactca tacttgacag caacagaagt cgcttgcgta catctcctcc tttacgttgt 
227521 tccgatacaa attccctacg gctcgggaat ctcgaggttt tgtattatgt attatcttcc 
227581 tcgggtcgtt cacaggtaag gaaatagtct cactgtttgt aagaattttg atctgatcac 
227641 ttcctttcag attttgaatt actccgcttt ttgaggacga caga:gtttcg aattgaaafcg 
227701 tgatggctcg catgaacctc tctottaccg aacataaccg cgggagagat ccacgtcatc 
227761 atgaagtaga cttcgcaatc ttctctcaga aactctgtta ctctctgctg aaacttcccc 
227821 gacacgtctc tgcctccaaa aacgtcgaga acgtgaagat tgttatcctt cttgactgtg 
227881 aacgtatagt tactttgctc tctctctgac ctcagaggta agtaaggaac aactgtcttg 
227941 atctctatcg tcacaacctc tgtgaagtct ctaaccagta catgagagac gatgaagatc 
228 001 gtgaaaacga tgagagatat gactgaggct gcagatgctg tgaagagtga tgatcgtgag 
228061 aggtttaacc gccggtgatt gaacatcgcc gtgaatttct tggcgatgac gttatccatt 
228121 ttggttaaga attaaaggta ggaggagaac tcagttttgt gaagtctgtg ttttgttgtt 
228181 ttcttatagt ttgttttttg cttggggttg . aagctgtcgt gttctgaaaa gttcttaagc 
228241 cgccagagaa cgtagacttc atttttttat tattattatt aacgagtggt tctcaactaa 
228301 gcctatagtc tgattacaat gcaagggaga gaacaaaacg gtgttcatta tgtgaaaata 
228361 atgaatgtgt ctaacgattt caaaggccaa gaatcttggc aggttcctgt ttttctttca 
228421 aaatgtaaat caatcgtgag ggtgagacag ctggttactt gctgattttc gcttctaact 
228481 atgctgttga atatcataac tttagttagc aagactttcc gacatattta atatttagtt 
228541 taatcttgaa ccagtgtaaa atagttacct cttcttagat attgtctaat gtctttagga 
228601 ttgatgtttt cttactgcat tgatactttg taaaaggcga ctagaaagaa cagtgtggct 
228661 tatttctaaa tgttatgatc attatcatta ctgtctttta gatatatgtt tttgctttga 
228721 tattttcagc cactaatcca acgtgacgga ccaaagtgca aaagatgtgg aatctacgag 
228781 gaagacaact tttatatatg atccatttga cccttctcat tcaccgctga gagtaaatac 
228841 acacacgcaa ggacaacaaa tttattgcta cttttctctg ccgtggatgc tcacaagtag 
228901 gggctggtga gtttccaaga acagtatatc ttgagtttta cattcatttc tgttccttct 
228 961 agctttacag tttaattatg gtgtaggggt gaatgtgatc attaaagata tcacatgctt 
229021 attttttgta gctgaatgca tccatacatg gaattgtcaa agctctcttt gaatcttagt 
229081 gtgggttcca gaaggtaaat cagtttttac tagacagttt ttttttttga agtttttgtt 
229141 tccacattga agactctcct gcttatataa ggtggcagac caaactatag tgtctaagtt 
229201 atatctccct caaggttttt gtagtcttct acctaaaccc atgtggaatc atgcattcat 
229261 gcctatgtat ttgttttcct cttagtttat aaaattttac acattaatct tatagagaaa 
229321 aaattcctgg cactataagt tggtaagtgt tatagagaaa ttttagaatc tcaaatgtta 
229381 ctttagatta gaaatactgc gtcatgtgtt agcatttgtt taaattattt ttaaaagtaa 
229441 ataaatgata caaggttcta gcgacaactt gctaaaccct tatttttgtt ttaacccagt 
229501 cgttgctaaa cccttatttg attagcggag acatggtcag caggaacaag gttggtagat 
229561 tcacaccttt gatctcagtt tccttagttt gcacataatt tcacagaaaa atccagattt 
229621 tctgattatg gattatacag gagtgtggca aaattcctgg agaaagcgtt tctggagcaa 
229681 agggagaaag gtgaagaaga aaatgcgagt ttgttacaga tttgcagagg aacaactcag 
229741 acggaaagcg caagacgctt ctacgagact ttggatgtca catctggttt ctctgtccgg 
229801 ttcaactttg ttggatttat cggttttcaa gtattggaaa ctctcttaat tgatttgaga 
229861 tgggcactaa cgtgttctag gtaaaaatct ttgccaagac tgatgacttt cagaatcagg 
229921 ctcactttca tgaagattat gcaaggagat ttaaaccaaa gatatttacc gcttctgaat 
229981 cttgccagag caagttgtag ttctttgatt tcgttttaat gaatatattt ttaagtgcag 
230041 ctcaacatat aaaatgtcat atttgttgga tgttgagcat tagatccata tagattttct 
230101 aaaatcgtat ttcatttgtt ttattttata tgtcgtttaa agttatctca cacatattaa 
230161 aaactacatt tttgttataa ccctaactaa tatactattt tgtctgtatt tgataattaa 
230221 taaagtaaga gaatatgaaa aagaaaagaa taggaataaa ctgaataaat atctacattg 
230281 aaaatataaa agacacctaa aatgacacaa aatttgaagt ttagagtgat aataattaat 
230341 actccttctg tatcactata atgtgtttaa ggatttcaat tttattccaa aataagtgat 
230401 gttcttttgg ttctagataa atgtagtttg aaatttgtga ccaattacaa aatactattt 
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230461 tttattggtt aaattaattt aatttaatat tattttatat aaccaaaata aaatattttc 
23 0521 ttaatcttgg tgaaaaacca taaaaacctt ttaaaatgat acaaaggaag tataaaacag 
230581 aggatataat gaaaaaccag acatgaaaaa aatacaattt ataaaaccgc atataaaaaa 
230641 tggaaactca gaattgaaca gagaaccaaa aactatttaa ataatttatg aatctaacag 
230701 tttttatagt gacaactcaa ataggtagag tgagaaatca aattatattc aagacttaat 
230761 tatttttttg ttatttctat ttgacatata ttttatatat taaaacaata aagtttagtt 
230821 ttaaataaat cgaaaaccat attattttat tttgcaatgc agtaatatta tacatgtata 
230881 tatatatata tacttacata tttgagttgg aaactcagaa ttgaacagaa aaccaaaaac 
230941 tatttaaata atttatgaat ctaacagttt ttatagtgac aactcaaata ggtagagtga 
231001 gaaatcaaat tatattcaag acttaattat ttttttgtta tttctatttg acatatattt 
231061 tatatattaa aacaataaag tttagtttta aataaatcaa aaatcatatt attttatttt 
231121 gcaatgcagt aatattatac atgtatatat atatatactt acgtagaata atatattaat 
231181 ttatatttac ttaaaactta tggaaataat ccgggcttag cccggaaaag gttctagtat 
231241 ttctaaaggg tcggcattct cgaagatcgg tctttgaacc aatcacgagg ttgctaaatc 
2313 01 ctaggtgtcc acacaaaaaa acgagttgaa gtcaaagtta ttgataattc aaag~aaagtg 
231361 atttgatttt gaagtttctt cgttgaaaag aaaaataaat aaaaagagag agagaagagt 
231421 gagtgaagga gaagacgaca gacaagtttg cagtttcaat ttaaaggttt ccgcgtacag 
231481 aattaaagta tatcagatct attcacataa cttgttttcg atctgtaccg tttactgatt 
231541 tttagggttt ttggatcttc tgtttagcaa tggcggataa ggccgagaag atgaagctcc 
231601 ggcaagatta ccggaactta tggcactccg atctcatggg aaccgtcacc gccgacactc 
231661 cctgtaaatc tctctctctc tct.ctctctc tctcttttcg atttctctaa atcttaatca 
231721 aagaaattga tttttctttg tgcagattgt tgcttgtcgt gtgtgtggta agattagggt 
231781 ttgttgttgt tgtttctctg tataatttgg tacacagttg atatctcctt ctttgtttgt 
231841 tttattgctt gcttttgtag tggaccgtgt gtttcttact tgcttcggag aagagcactt 
231901 tacaatgaca tgtccaggtc tcaacttttt tttttttatc tatgtacatt attattattg 
231961 gtattgaaga tatttatttc tccctttgct aagatattgt gagtctttat tcttgtaggt 
232021 atacttgctg tgctggatac atgccctgta gtgggaggtg tggagaaagc aaatgccctg 
232081 aactctgcct tgctactgag gtctcacttc cattaataca tatatatcat tacattaaag 
232141 ccgcttgttt ttcggtgaac aaaatgaaac agcttacatt ttcttgatca ttcttttttt 
232201 cccctatctc tgtggcattt gttctaatat gagtactttt aatccctttt agacaattat 
232261 ttcctctttt tctgtgtgtt tgtactctcg tatttctgat tgctttgctc atttcatttt 
232321 gactgagaga cttacattat ctctgagatc catacatacc aatgaactgg attttttttg 
232381 gcttgctggc atgagtagtc aacttgctat tattcaatag tagatctctt gatgattcga 
232441 catttttttt ttaagaaact tgtaattgtg attttttagg ttttcctctg tttcggaaac 
232501 tctgtggcct ctactcgctt tcttctccag gatgaattca acatccagac tacaaaatgc 
232561 gacaattgca ttattgtact ctctttctct ccttctgttt taacttgttt tacacagaat 
232621 gcatatacac atctcattct catccttggc gttttgcagg gatttatgtt ctgcctaagc 
232681 caagtcgctt gcattttctc tattgttgct tgcctcgttg gtagcgagga actctctgaa 
232741 gcttctcaga tactctcttg ctgtgccgat atggtctact gcacgtatgt ttttactctt 
232801 ccttaattct tttcacttta tcctcttagt atattaaagg taaaaccaat gctaatactt 
232861 ttacgaccgt cttggtgtta tttttttttt ttcagggtct gcgcgtgtat gcaggtataa 
232921 gaaagcgtga actactaatc ttttagcttc tgaaatatat tttcttatgt tatatataaa 
232981 ccgaagacca atgttatgct tgcagactca acacaagctt gaaatggaca aaagagatgg 
233 041 agtgtttggt cctcagccaa tgggagttcc accggcacag cagatgtccc gttttgacca 
233101 acctgcccct ccagtcggct accctcctgc gtcttaccca ccggctcaag gctaccctcc 
233161 tgcaccttac ccaccggctc aaggctaccc tcctgcatct tatccgcctc ctggttatcc 
233221 ccaacattga ggaactttaa accaatatct tcatattgtg tgtcaaagtg ttgttgttat 
233281 tactgttcag ctatttttct cagttatgtg gttgctttgc aagtattagt actctggttt 
233341 atgcactaaa acatttggta tcatctggtg tgttgacttt gttggtattt ttctcccatt 
233401 tatctgatga acgcaaaaat gtgtaaacct tgctatgcta tatgtgtggc tgtattcctt 
233461 ttatctttgc attatctgta ctgtgatttc tttgcaattg cctaaccagc ctgttcaagt 
233521 aggctactta acactgtcaa gtagcccaaa acaatatttc ggtattgttt tgggatgctt 
233581 ttcacgatga cctatgacca aaccggtaac atacaagggc ttgccaatta taaaagcaca 
23 3 641 tgcgcaagcc tttctcccga agctgaatag atgagcggaa tactgatcct gaggactagt 
233701 catggcatta ataaccacaa gttggaaact aatgtttctt tcttctttat tttagttata 
233761 tctgttggca ctctgctttg tattcctaac atgagaatcc gagtagtatt ttttattttc 
233821 tcccgaacct gatttgtcta taaatatctg ttctttataa taagatattg cgcacgaggg 
233 881 tgggacgtgg aatcatcgag gatgggctaa attgcttatg aagtttattt aagtcttgta 
233 941 aaaatcttga aaatattttt caaaaacaaa aaaaatcttg aaaatatata acaaagctaa 
234001 tgattcaatt catactgcct tttacgcttt taagtcaaag tgggtcaacc ataataacat 
234061 tggaagagtt gtctatcgat taatttaacg acttctccca ctatatttag aaaattcagt 
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234121 gtagtacgtt tgaagtgtaa agaaaaacta acttattcag gtatatgtag atttatcata 
234181 tttcaaaatt taatcaaggg atgtgctagt atatgctcca tatcgcagga aactggtact 
234241 gataaaataa tagaatctac aagttgatat ttcttcacaa tggtaaggta aaatagtcga 
2.34301 ttcttcaaga tctaaaatta cattagtata ttgttctggt ataacatgtt tagttcgcct 
234361 tttgcaattt agcggcggaa aattggtatt tttttaaaat tatagttaat tatttattta 
234421 tgttacttca tggtttcaca atttgccatt ttttgtgtac ttcatggtat tgtttgaaat 
234481 tttcaacata atgtttttat agattctatt tatttttgtg gtatatctga gagccaaaaa 
234541 atgagtagaa gcgtgtgggg ctcgtattag tttagaaata aataagcaca cgcacagaca 
234601 agtccatgtg tgcgacgtgg cgacagcata acatattcca atttccaaag tttcccaaca 
234661 catgcgaagc ttcttcgcgt cacgctctct gccttcatac tcttttgcag gaacctctca 
234721 aattatattt aataccccaa tgattaagtg ttgattctta agaatgcaaa agattaacaa 
234781 gtctttcagt tcatagaaaa ttctacaaaa ataaaagcaa caatcctatt tcaagtatac 
234841 agcttaccaa gatggactat gattaagctt cgctagaaag tttaatatgt acggtattaa 
234901 tcgtaggccc cagattttag ttcaaacact tacgattaaa aatcgtataa aaacacccaa 
234961 attctttcat atataagagc-gtgaattaca agacaaatag aga tgatgaa aaaaaacctt 
235021 catacaacta tacaaagcga gaattataag aaaattgatg aaaaaagttc ctttacttag 
235081 atacttggaa aatacattgt ttaccaaatt ttttttttca agtttatggt ttttgatgca 
235141 ctttatttgc ctatttttgt ttttttggag aaattatttg cctatattag tccttttaac 
235201 ttttgagtgg agagcatata gcctcttggt ggaggctgcc tccatcgttg attcgttgcg 
235261 ttgtcgttct ttgactttac aaatctttaa cttaactctg agataaaaaa aaaaaataac 
235321 ttaactctga gataagttca tctattttcc gtgaaccaca tgtgtcaaaa ggaaatagct 
235381 tcatggatct tttatcagtg aatagttatg aaaacctcaa aattttcttt tgctatattt 
235441 taaatttaaa aagttaacgc gtgatttaca atttgtagtg agaaactagg tagcacagaa 
235501 gcctactatt tgacactgga tgttttttaa gatttcttga caattttaat aaaataggaa 
235561 atattacaca tttttttaac gctgatttat tatgatatta caattacaat tattaagaac 
235621 attacatata cgattcgaca actgacaata ctcatagttg cctaataaat cgcactctct 
235681 ctgggacttg aaatctggat ttgagaaaat ctgcaataaa ttgcatagtc tgagaatcaa 
235741 accccaaact tgggtgtaga aaactttaga ccttaatcat tatgctaaga tgcttccaca 
235801 gaaatattac acatttatta atttgttaat gagcaagtcc aattatgttt ccatctaatt 
235861 ttttctttct caaaataaag ttgtttttgc ttagataaca aaaaacctgt gaatgaacag 
235921 atttattttc acggtaacag aaaaaagagt gcaaataata actcaaactt tagtttcctt 
235981 ttctgaatag atgatacaag ttttcttctt cctctttcac tagcctgtaa ccactttcta 
236041 tttttaattt acatccatca gctaccaaac acaatactac taacaaaaaa gtattaacaa 
236101 agaaaataac ctttcatgtt accctgtcaa gcgagaggaa aaagaactat gtcctaagat 
236161 gattgagcaa gaagttcaga aacgtcttca ctagattccg accaaagctt ttgactaaag 
236221 tccgtttgaa gctcatcagg tacaaactgg gtacgacaaa gcggacacgt catctgacta 
236281 taatccataa tccaacggtc caagcattct ttatggaaaa tatgcctgca attcgtcagc 
236341 cgtcggatct catcgtcgtt ctcgaagtcg tagagacaca ctgcacaaca ttcggattcg 
236401 ggtcggttta gatccgagaa tctgacaacg ggcaagatct gtccagctag cagcgctgct 
236461 gaaaacaagt gagagtcatg gtggcttgag gtggtggagt ttggtggtgg gtcgggccat 
236521 gaagatgtaa ccggttcggg ttccaagaaa tcgggtaaac caatgaacca gaaaagtgtg 
236581 gagattagtt ttcgtatcaa acctaagaaa gagagtacat gaatgaagat ttttgggagt 
236641 aggagctcgg aatagcccac cggaaaaccc attgcttcgc cgttagagag aaaactgtaa 
236701 agagagagaa atagagagag agacttttgg attgtgatga aaagatggag agggtgggat 
236761 gtatatatag ttggcgtgtg tatgtgtata_ ttaatattgt atatgtgatt tggtaattaa 
236821 ttattgctcc gttaatttaa tggagtactc aataatgtga aaattttcgt aggataatct 
236881 tttatttagt ttcttttcaa taaaaaagtt ccaaaaaaaa gaaaatgata tttttatgaa 
236941 agagtaacta tatatttata tttttagatt aattaattta aactgactgt ttaggggtaa 
237001 aggatgagga ggttttggag gtgtttttgt aaaaaaatat taaaaatagt ttttaaaagt 
237061 attttttaaa agtattttcg aatttataaa aaaaatgaat ttgtttcgaa aagttattaa 
237i21 aaagttcaaa tttgaaaaca cataattcaa atttataaaa acattcttta cttttttatt 
237181 ttttttaatt ttattattat ttatttatat ttatatatct ataaatcaag aggtagaaat 
237241 gtatttagtt ctttaataaa acttattctg gtcatttctt cttttctttt tctttttttt 
237301 ttgtttagat gtcattttct tctttaggaa cttttttatt gacaaaaaac tttaaaaatc 
237361 gttatttgaa atgctttaat ttgtttctgt ttgegtggtt ttctttggta tttaattatt 
237421 ttgcttgatg aggatatagc tacgaatgtg gtaatgatct ttctttaggt tatagtatat 
237481 gcatagacta ttgattgttt ttgttcatac tccctataaa gataatagaa atcaagaaat 
237541 ataaagtatt aatgtaaatc cagaaataca tcaaataatt aaggtaaaga cagatcgtta 
237601 ttcgttaggt tcatttcatt ctcagtttgt cgacacctat attgcattat acatgctatg 
237661 tacatgcatt tatatttata ctttagagat tgtttttggt taccgtgaaa ttaggatctt 
237721 taaaataatt ttttaaaaaa aaaatttacc ttcaaagtag gatattttca catcaatgta 
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237781 gattacaaaa aaaaaataaa gaatgaaaat 
237841 acatcaattc actgtactag aaaatcttaa 
237901 gaaaatatat ttatcatacc aagtgtattt 
237961 ttcagtccaa aacaatgttt gaatcttcac 
238021 acgtcatttg ctattatgat tcttcatcta 
238081 aggtgaacta cgtagaaaag cctttgcagt 
238141 agaatacaat attaatttac gtaaccaaaa 
238201 gtcatatggg tatctgactc aaatgtatga 
238261 accctcccca tcttctcctc caacatttca 
238321 tatactgtat acactcacac acagacctta 
238381 tgtaacttga acaatatttc aatgaattaa 
238441 ttgataaagt agatgctttg tcatactttg 
238501 gtgagaggac aaaaatcacc aggctatttg 
238561 taccacacac tcatgtttat attacatagg 
238621 tcactgtta g t aagattata ttaactgata 
238681 ttttgtctgg acaattaatt taacaaccaa 
238741 . ttattttttc tttcaaatta cttaggaact 
238801 ggcaatttac attcacacat aagggtttag 
238861 tgatgactga tgtgtcatgt gtgtacgtag 
238921 ttttttgtta acatattttc ttcagtctac 
238981 aagtaaataa tttagaaact tcaaatatgt 
239041 agttagtaac tgtgatatat atactctatg 
239101 agtttctctc ggttattttg taagttcata 
239161 tatttgcatt ggtcatctgg cttgaaccaa 
239221 catatgcact gttgtactat gtacatgtaa 
239281 acggatatta ctatagtcat gacagcatgc 
239341 gaacagtaaa aagtacttga ggaattttgc 
239401 ttaacttgaa ccttgtctgc taattcaacg 
239461 tgtcagagac tgtcgttttg tcagcaagcg 
239521 accagctaaa ctataattaa agtatccaaa 
239581 aatcaattca atatatacaa ccaatttcat 
239641 tagatcatta ttgtttgtct attttctcag 
239701 ttggttgaag aaactgaaat ggtgagaagt 
239761 ttttttactg gggaaacacc aactgatcta 
239821 tgtgaatatt cctaattttt gtccttgtta 
239881 taatgaaatt aatccgaaaa tgaaattatt 
239941 aacaagtcac actatgtcaa tgattggtca 
240001 ttttctgaat gcatttattt ttctctttta 
240061 aagaacaact ttatatttga agtttgctga 
240121 attcactagt aaactctaat ttttatttat 
240181 agacaaaaat caaataaaaa cataaaactt 
240241 ttctaagtaa tat ttattaa taaatat tac 
240301 attttaaaat acatgactat tcattattat 
240361 gatcaatcag ttgacatttt agttttatca 
24 0421 agtgcatcag ctaggggaaa attatgatat 
240481 aaactaaact gtaaagaaat gcatgaggat 
240541 atcgatgatc atactacttg tgagtttatc 
240601 gaactcaagc gtaataacaa cctccaaatg 
240661 atactagagg tagagtgata atttatcaac 
240721 aaataccaga aacaacaatc tagaaaatga 
240781 aacaattagt attgtcatgt tttaaacaac 
240841 gttagtgtgc tcgtgttcct tgagaacaaa 
240901 agcaaactgg tcgtgaaatg tattgttagg 
240961 gatcttttat ccttgctttg acacaattag 
241021 aatcttcttt acccaatcga tcctccttct 
241081 tagttcaata tcgatatttc ctataaacac 
241141 actctaccca aatattttct tagagaggac 
241201 tgaagctaat atatttatta acaatgcttt 
241261 tatatacgag aaagaatgta aaattaagct 
241321 attcaacttt atgcaaagtg ttatcagagt 
241381 gactctacta ataccatcta gagtagagta 



tagattctac tctacaaaat caatgaaagg 
ttaatcgtca cattttggca tggacaaggc 
gatatcacaa tgatgaatac acacattgag 
atgactagta gagatcatca gtctacatat 
ttgacttttg atatgtattt ttatgaccta 
tctttaagtt tgattttgat tcgtacttct 
tatttgaaga tctatttcta gagaggggga 
aagagctaaa gcagctctga cccacctcac 
cattccgcca ttcaacactt tttgcaacta 
tatatatata taaggttgtt tattcatcta 
aaaattccca atttaattgg atttatatgt 
gaaaaattga atccaccaca tgtagaaggc 
tctcccacgt acgtttatgt acagcaagtt 
aacaagggct tacagaataa ttaaaaagcg 
agccagataa atccagccaa gattaatcct 
.aattcggtta agaaaaacac ggaaaatcaa 
gcagtgttca aaatttatag tttataccct 
gtaaggtgtc aacaaagttt taatgatcgt 
aattttgatc catttttagc atactattat 
atgcaaatat caa.cia.ctcta aaatgaggaa 
tagaaacttc cggtcctctc gtaacagacg 
gattggatta tgacagccct tagtttctct 
tattcatact tatcaatcta aaaccgtcaa 
tactggagtc actttgtaaa atgttaatat 
agagtttgtt gtatatattg tattttatgt 
tttgcaggat ttaaattatt ataagaaagg 
tttaagtttt aaaagtagag ttaaacatat 
gaaaactaca aagtttattc cagttttgca 
accaactaat cacactaatt gttttaagca 
tatatatact tgtctttgtt tatattcaat 
agttatccag attaacactc acttgactct 
atatattttg gtttagttta. ttcttttttt 
gaggcttgat ccaaggtctg gtgggagagt 
atgaattttt tttttggttc agtatattca 
tatttttaag aaacttttga gctattccga 
ttataactat tgtaattatt ttctcagatg 
tgtagactga cttggaccaa gtggcttttc 
atttctttga atgaatttaa gagcatcttc 
attttatatt tgaggtttga tggtattttt 
tttatatttg tacatcatag tctttacttt 
tataagaact aaaatcatag aataaaacta 
attctactaa agattacaca taaataaaag 
tgttattatt ttcacaatgt tcccacatat 
aatgaacaat tccttgattt _c£taaaataa 
aactcaatta tgtatgctaa acaagtaaag 
agtaaaatgt tattaacaaa caaagaatct 
gatctgaaca agaaacattc acaaagggaa 
aacaaaaatt ttatggaatt ttttttggtg 
acaacaacaa gcaattcaag ctctttaaaa 
gttcgaaact tgaacaagac aacatcatca 
ttcgaaataa taaaatgaac aaaaagatgc 
tggactttga gaaaattcaa atacaattgt 
cttattaggt accaattcaa tcgtatctgt 
aaaccatttt caccaatcgt tgtgattcat 
ttctcagcca aatgagacaa gctcaaccat 
acagaaacac gttgcagtca cacacagtaa 
atcaaaggtt ctcaatctca agtacggttt 
gcttaatatg gataacatca aaacagtggt 
aagctcttgg ttgacccttt gggtagaaat 
atctgcgcct tgaccaaccc taaacacccc 
actaaggatt atcttattat gtagtaaatg 
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241441 ttaatactac ctttgacctc ctctggctca tatttttgca gccttagaat gctgattttc 
241501 cagaactcaa tacgatctca accagctgaa tgttgccgct cagtaaagat gccttgcgag 
241561 atggcaaaaa agagaagcca tccacaagtt taacgcactt cagaccccat ttcaccatga 
241621 ggaattcgat tagatgatgc ccgctgcagt ataaaaatcc atacacaaaa atcaccatcc 
241681 taatcttttg cagatatgtt atatgacaag gatgaatgaa ttgtgtacct cctgttatga 
241741 tacgtggtga taaaaactgc atctggagaa ttttgaagca ggaaagagac ggtggcaaag 
241801 agatcatcga acgctatata caaaacaaag gttcgcgatt cacatttcag. ctcatggtaa 
241861 caagtaatct gggagaagta acgactccac aagggaaaat tagaaggcca aaacaaaaag 
241921 gaaacagatg aactaaccac ttgaatcata tagaacatct gctccaagta taatgttagg 
241981 ccgcagatca aatatggttg catcccacac tccccaagtg agacccatta cctgccaata 
242041 gaccaccata aggtatgttt aaatgaatgg agaaaacacc atgagacaaa tatcttggag 
242101 atacaaacaa acatatacgc tacgctccta agcgttatat atatgcgatt aacgattatt 
242161 gttaatccag cgggcgttga agttgaagaa agtgaaaagt agtaaacgtt acattacagt 
242221 tgagattgtt aagctcacaa actcttgtca tattctccaa aacctgcaaa tgtatcatgt 
242281 atggtgaaag atgagtaata aactcctagt ctctgaacca aatgagaaaa caaagcttat 
242341 caatacattt tcacacctct gttttgcttg agtcatcggt aagggtgaca ttagctccaa 
242401 ccttagcagc tactaaaccg ggtaaagaag tgccagctcc tagctacaaa ttacatcaaa 
242461 cacataaatg tcttaagaaa agaagtatat aaaagtcaaa aaaagacatt cattagctca 
242521 aacacagaac acaatgatga cataatccca cttctttaat gaagaaaagc taaaaaaaac 
242581 aaaacattca ctcaaatctt tgttctcgat ttcaattgag aaagcaggag tggtttgttt 
242641 acctcgagaa ctgaagagcc acgaaatcga gatcgctctt gccagacgta ctcggcgagg 
242701 atgacgctac aaggccaaac gaacaaacca tactcttcct tcatattcta cacacacaaa 
242761 aaatgtcaca agccaacgat gcaaaacgaa aaccatcaga agaagaagga tagcttcata 
242821 cctcgataac agatacagag atggaagttt cggaatctcc tccatcatcg ccatagaaat 
242881 catgctgaga gatggtggta gtagtcattg tatattcttg agattttgaa gttagttttt 
242941 ttttttaatt taagttgagt ataagggagg aacggagaaa cgtcccacat cggtaagaag 
243 001 tgaagttgag gacactattg tttgggtata aaaaggcttt gggtgggtta gagtaaaaac 
243061 atgatccttc aggcaagtga aaggggatat gatgagtggt aaatattctc gctttatgag 
243121 agaagagcgt ttcgctccgc ctgttacgac aattgagcgc tctcctttgt cttaccatct 
243181 gatccctctc ctcattggtc actttttttt tttcataaaa caaattttgt ttattaacta 
243241 taataatagt tattgttttc tttgaaaata cattaatatt tttcaaatat acttattaga 
243301 taattacagt ttatattcgt tttagtaaaa aaatattgga actcttatta tttttagtgt 
243361 ctttttactt gaacaacctc tatttgtctt accatcttag attcaaagct cccacgctca 
243421 ataatgtcga agtcgtggga actgctgctg aattcgaaca ctatctctct tctcacttct 
243481 gatgggtcaa aatctagcaa gccagagcca aatgtaaaac aaatgttctg ttcgacatct 
243 541 gatttgatga agatgagata aaaggaataa gtagatcatt agaaaacgtt ggacgaaaac 
243601 tatctagtta tcaaaaccga acgtgcacat caaagactta agtcaaaatc tttctggagc 
243661 aacctcaact tctttggaat tttgttgcaa gatcagcatt agaaaccaga aggtaggcta 
243721 agacatagca acaagcatga atgaagaata atgaaacgca acagaaaaga gtagttttgt 
243 781 cttaaaaaaa aggagaagga ccataacaaa cggtttgaac aaaaaaaaag accataacaa 
243841 acggaaagtg ctcttattat gctgcatcaa caaagaaaac acaaacaata ataaaactct 
243901 aaaaactcta aaaactcaaa ccagaacctt gaagtttttc tttgatgatc atgtctatct 
243961 ttctccattt cagacgtaag ataatttttc gagtcaccga cttctccttt acggaaaaaa 
244021 aacttgtgct ccacattgtt tattgtccct gttttgttag cctcsaaact gctcagatta 
244081 cgcagagagc agaggtccaa gatcccgtcc acagatccac tatcttcttc ttgcttagta 
244141 aaaggacatc ccaagaagtc cgcaagtctc ttgatctgat cacgaggctc ggctttcatc 
244201 tcctcatacc tcatgaaaag gacatgcttt gggtcttcca agcttcctct ccagtaactc 
244261 aagagatgat cccaaaaagg tccaaaatag ttggttccat cgcagaactc gttaaacata 
244321 gactcgagaa ctcttctggt tggttcgatt ttatatatag cacagctgta aaaccaacac 
244381 gagatcaacg tgtccttcac gttcctgcac acgtacacaa tcttgcaagg agagtgcttg 
244441 agtgcttcgt ggatcgtgtg cagtggcatg tgagtcgaga acagcctcgg aggtgctgag 
244501 aacttggcta ggttaggact tgagctttcg tggtacacat cgatctccaa gaatggtata 
244561 atgccatgag gattatgata taggagagga tgatcagaag agtggttctt tgatctctca 
244621 agcagagcga ctgtgagggc cttgagccaa gtagtgcctg atttggggta cgaagcaatg 
244681 attacatcag tgtcttgcgg ttgaaaacct ctctggaaat tgagacacct tggagggtgt 
244741 tgtaataata ccaacatcct tgatatttac aaatcttatg cccttggtaa tctgtgtgtg 
244801 aaggaagcga agagatcacc ttcctggttt cttcacttat cttgtcgtct ctaaggttcg 
244861 acggaagctc cttcggatcc atttgtgttt cttggttctt gacttcttgt tactcgttac 
244921 aaaaggatat gtttaagttt caaacgaagt tgtaatagat ataagagaag aaattaagcc 
244981 agaaaacaga tattgctggt aataaagacg aaacctaatt tattatttta tcttgaaatc 
245041 ggtggtggtg gtctagtggc gcttgagaga attaaaaaat ccaatacagt. gaattcgatt 
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245101 caaaccccac cggccacatg gatatgggtt attgctttcg actcctttga atgtccagga 
245161 aagaatttat ccgtggactg tacatccatc cggaggttag gtctgtgtct ttaataaact 
245221 cggatttaat tttttttaaa aaaagaaaaa aactaattta tttcgctggt gatacgaaaa 
245281 accttaaggg cttagattca tgttaagaaa atatttaaca tattgaattt cattttctat 
245341 taaaagaaac gaaatattta aatatgaatg ggttaaagtt tggtgcataa tatttaaaat 
245401 aagtggaaca aacattacat atatatgtat actattttta atacaattac ttattttgca 
245461 ttcatttata tattatgaaa taataaaaag gtgatgattg tttgaataat ttttggattt 
245521 tggattttgg tttttgattt ttggcttttt gcttttgatt ttagaatttg gtttttaaat 
245581 tttagtcttg attttttttt tgtagatttt actttttcaa aaaacttgaa tggtaaactt 
245641 agattttggt ttttgttttt ccagactaaa ataagttatt aaaataatat atttctttta 
245701 aaacaataat atacaaatta ttattattgt catccttata aatatattaa aattaaatta 
245761 aatatattat ataaataaaa taatagatat aatttataat aaatataatt ataattatat 
245821 ctccaggaaa atatatagat agttaagatt ttaaaatgga agaaatatca ataacggaat 
245881 tttaaaagta tatttaattc taaaaactat tttaggaaaa aaatataatt gtattttaaa 
245941 ttgtaaagat acataaaatt ttgcaattca aatattttaa cacagaaaaa taataactat 
,246001 atttaaaata aaaaacaaga atattataaa tgtatagtaa ttaaaactaa caaaaattaa 
246061 cttaaattct atagtatagg atattaatta attaactata aaattaatct atctaaacat 
246121 aactccaata acttaattcg tagggatgat tattaatttg ttctattcat atttttgtag 
246181 aataaatgta actctttcaa caaaaataaa tgtatatttt atatagtaga tgtatttctg 
246241 aatgtttctt tttgcaaaat aaaaagttaa aaataaagta tattttaaat aactaatttt 
246301 atttttaaaa attttcaatt tttactatat attgctttca tattatatat gttaagcaaa 
246361 tgattcttac taaactttat atttaattat atatttaatg catgaattta gtaaaaaaat 
246421 atttactaaa aaaagtgaca aatattgaaa accaacgtaa ctttttttta gaaaccacaa 
246481 aaactttttt ttggtttttg tctgaaaaaa cagttattga aaaaaactat ttttttttac 
246541 attttagaga atccattttt ttaaatcttg attgaaaaga atagtttttg gaaaaaaaac 
246601 aaccaaaaac tatttaaaaa ccacaaataa tcatcctcta aataattaag aaacctgtaa 
246661 ctattacgta tataattaaa ttggaatgtg cacataaatc aaattaaaca attttattta 
246721 ctactattta tggtaaaata attaaaaata tttctttatt tgatatggta tataattata 
246781 ttttatataa tattaacaag gagatatatt atattaaaaa aattgatttg ttacatttaa 
246841 ttcttactca tatgatttat aaccatttgt attattgtta taaaataata aaccattgat 
246901 cacaaaatat tcaataaacc attgatcaca aaatattcaa tgtagaactt ttataatttt 
246961 aatatattat aatctaaaaa atttaataaa aatttaaatt taaaatatta atttcttaat 
247021 acttgttcaa tgaaaatttg aaactaaaaa attatgtatt ttatatggtt ttcttatcat 
247081 aatgtatttt atatggtata tagattaatt taaacaattt acaaaaaaat atattttaat 
247141 tagaatatct attaaatgaa acttcatatt tatattgttt tataatcatt tatcttatta 
247201 taaaaaaata ttgatcacaa atctttaatg tggaatttta cagttttagt aatttgtaat 
247261 cttttttaaa aatttgataa catataccaa aaaattaaaa ttttgttata taattattgt 
247321 aattgtttaa tttattttga ctattatatt ttagcgagta ccatgaagtc gaacaaatag 
247381 ttattaaaca ccataaaagt cgtgaattca ttataaatat atggaaactt gtaggttgaa 
247441 acttactgta agaagactat tgaagaccat aaaagtcgtg aaatcattat aaatatatgg 
247501 aaacttgtag gttgaaactt actgtaagaa gactattgaa gttgcttaca tcttgcatat 
247561 ataatatatg gaaacttgta ggttgaaact tactgtaaga agactattga agaccataaa 
247621 agtcgtgaaa tcattataaa tatatggaaa aacttgtagg ttgaaacttg ctgtaagaat 
247681 gaactagacc atgaccttag agatgtcaaa tgggcgggct ataaatgggc ggcccgtgtc 
247741 caaattgata tggtctgaaa tgaacatgcc cattttgatc cagtaactaa attttgtcca 
247801 catggacgag cccaattata acatggacaa tattgggttt aaccacttgg acaatgggcg 
247861 gcccaaaaat ctaaaatatc taggtttagc aaattggaaa aaacacaata tttttttttc 
247921 caacattttc cttcctgaag cctgaaagtt tgatttctaa ctgtttattt gtgatttttg 
247981 ttatggaaaa cataattttc cgctaaaaat gaaaaacgta attttccgct aaaatcgaaa 
248041 aaacgtaatt tttcgttaaa accggaaaaa catagttgcc caccaaaacc ggaaaaatgt 
248101 aatttcctgc caagatcgaa aaatacaatt ttccaccaaa accgaaaaat gcaatttctg 
248161 ccaaaaccga aaaaccgaat ttcctgtcaa aatcaaaaac gcaatttccc atcaataccg 
248221 agaaacataa tttctcgcca aaactaaaaa cgtaattttc tgcaaaaatt gaaaaacgca 
248281 attttccgcc aaaaccgaaa aatacaattt ctcgcaaaaa ccgaaaatgc aattttttgt 
248341 caaaacagaa aaacacaatt tcccgcaaaa atcgaaaaaa tgaaaattat attgtttaca 
248401 ttaaatgggt gtatgggtcc catggacagc ccaacaaatc atggtcttat ttggttatgg 
248461 tctcatttgg acatggttct atttgggctt cgaccaaaaa tgtccaacaa aaaaatgaag 
248521 cccattcgga catgcccaaa cccgcccggc ccgcccaatt gacatctcta catgacctgc 
248581 ccgattgggc ggtgtttatt taatttttga tttacacaaa atgatgtatt tgttaatagt 
248641 tacttgtgat acattcagag gcgtatctac tataggatta ggggttcaat tgacatatgt 
248701 ttaattatga aataattggt tttgcatagg atgccaaaaa gaataagcag ctaaaatggt 
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248761 attaatgtat gtattgacac acctaaatct gagttcacat atctaaatta tacttttgtt 
248821 atcttatttt taaaatttta catatcatga tacatgtaaa aaaaattgat gggtccgcca 
248881 ttggatacat tgagttcttt ttgaattaaa aaaaaacaat caaatctgtg aaaaatattt 
248941 tttattacat agaaagctta aagtgaacaa tgatagatta tatatgaaat atataaatca 
249001 atttagttaa tttaaaatac attgaaataa ttgactttga gtcatatata aatcaattta 
249061 atcgactcat tgttattaca ttttttgtta gggccatagt gttttagatc aaattaagtt 
249121 ggtctattgg ctcatgactc ccctctctta ttctaaccga cagttttgtt ttgtaaccgt 
249181 tcggtgttta gaaaacaatt gtgaagaatg aaaaaaataa atatctcgac tttcggtgtt 
249241 ctcccatgcc gatgacaatt tcaaaactca ttaatcgagg gtctaatgtt tgatagtatt 
249301 tgtgaatctg atgctttttc tttacgaatc aaggtatgga gtaattaaat ctattttgaa 
249361 ataattggtt tatttttcat tttgtttcaa agtatattgt ttgcatgtct agttacacct 
249421 agttaattaa taaatgtcca taacatttat aaagcaaatc aattgtttcc taaaaataat 
249481 gtagttagca agaatatagg caattgcccc attcttaact atttacttaa tcttatattt 
249541 aatattgagt actctgcaag ttcagttaaa cagattttac tggaaagaaa acatagataa 
249601 tttattcatfa agataatttt ttagatttat tgagacctaa ctctatgtga ggtgaataaa 
249661 cattggttta gatgaaaagg gaaattcaat tttatcttga aattatttta atagagaat'g 
249721 gaagttatat tttattttac gacaataaat aatttcgaaa ttaatatcct agatcttttt 
249781 ttttgataac aaataaataa ttttggaatt tatttaatac aagagaagtt aaattttgtt 
249841 ttaggacaac acattaacta aactgaaaaa gagaaacatt aaaatagaga ctatctttaa 
249901 ataaagagag tgataaagac acatttaata aactggttaa aacaagcatt tttttaggaa 
249961 tgttattaat agggtctaat gtttgatagt atttgtgaat ctgatgtttt ttctttacga 
250021 atcaaggtat ggagtaatta aatctctttt gaaataattg gtttattttt cattctattt 
250081 caaagtatat tgtttgcatg tctagttaca cctagttaat taataaatgt tcataacatt 
250141 tataaatcaa atcaattgtt tcctaaaaat aatgtagtta gcaagaatat aagcaattgc 
250201 ctcattctta attatttact taatcttata tttaatattg agtactctgc aagttcagtt 
250261 aaacagattt tactgggaag aaaacataga taatattatt catacgataa ttttttagat 
250321 ttattgagac ctaattctat gtgaggtgaa taaacattgg tttagatgaa aagggaaatt 
250381 taattttatc ttgaaattat tttaatagag aatggaagtt aaattttatt ttacgacaat 
250441 aaataatttc gaaattaata tcctagatct ttttttttga taacaaataa ataattttgg 
250501 aatttattta atacaagaga atttaaattt tgttttagga caacacatta actaaactga 
250561 aaaagacaaa cattaaaata gagactatct ataaataaag agagtgataa agacacattt 
250621 aataaactgg ttaaaacaag catttttaag aatgttatta atagggttag tggcatgtaa 
250681 gtataattaa cattgaaaac ttatgggcat ttcctaagtg tacttctctt ttaataatag 
250741 agatattgaa gttgcttaca tcttgcatat ataatatatg gaaacttgta ggttgaaacc 
250801 tactgtaaga agactattga agttgcttac atcttgcata aatatatata tatatatata 
250861 tatatatata tatatacgag gtaataattc atgatgtggg catagttaaa taattaatta 
250921 gattatttat tttacgttgt attaaaaaac ttgtatagtt ttttttctag agttagatat 
250981 taaaatataa ttcagatatt tacaaatttt tgtttgaatt ttgtttgaat tttttteaaa 
251041 tcattgcagg ttcggttagt aggttcggtt agtgtttgag tctaggtagt catcttagtt 
251101 atgtattttt tttaacaaaa atccaaatat atttaagtct tcaaatttga aaataaaaaa 
251161 atataaaaca taaaagttga ataatgtaac actaaacact taaatttaca taaaattagt 
251221 taaatttaaa catttggata aagaacaaat ggatattttt agtattttga atttttattt 
251281 tagatattta cttgtgattg ctttgataat tttgagatat tttcatattt ttgaatatta 
251341 tagttacaaa ataaaaatta tattatatat agtactgttt gttctctcct agatgcggca 
251401 tgtcatcgcc atgtcagaaa gtcaaaccct gtaatgttga tatgtgtcta gccagagtaa 
251461 aacacaacac ctcatttatt tggagtttcg tactgggctc agttgttaat ttgtattgtt 
251521 cgggcctgta aaaactgata gttgggctat gtatcagtaa gcaataatcg tcctcgttat 
251581 cgacatactc tcgatgaatc caagcgttga acatgataga aacggcttaa tccaagtata 
251641 gtcgcaactt ctttaacggt tttaattcaa tgcacctttc attatttctt ttgatctttt 
251701 gagtgaaact tagactagat aagtctcatc ccatgaaatt catcagttcc attttctcaa 
251761 tctttgtaaa ctttcagcaa taagaatggc taaatctttt ctattcttcg aagatttgac 
251821 ggctgttggt caagtatttg ttaagatact catgaagtat tcgccaccgt ttgagattga 
251881 gccgcatttg agatcggttt tggttgagta aagatgggaa ctgaagattg aacggaagaa 
251941 agcggccttg accgtcaaag atggtgtcga gaggattgta gagtggagga tggggagagc 
2 52 001 catgatggat cagctggagt gttaggtttg gcaagtgagc tcgagacata ttgtgcaggg 
252061 cactagaaag attcaatata tagtgattat ttagtgtatt tattgtatat atgtttttta 
252121 tcttctaatt tgaattatgt ctatattaaa atgtataaat ttacaaaatt tattagtgat 
252181 tactttattt ttaaatctct aactaaaaat taaaggttga taaccccgta aatctttcta 
252241 aaaccatgtc atctgaaaat aaatggtggt gatgatacgt gtacaaattg ctttacaaat 
252301 gattgttagt actgaagctc taaaatgtaa actcgtacca ctaaactaaa tccaaacaag 
252361 tttagcgtga cgattatgtc aaaggttcaa ctctcaacaa tgtataattt gcatttattt 
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252421 ttatttaaat cagttacaaa aaaaaaaaca aaatccaaac accatagata gcatattgtg 
252481 aattctgaat tttggtgtta caagagaact agacttttaa gactagttgg gcttagatta 
252541 aaaaacatgt aagcccacgc atcgaatatt gggaagtggc cacgtcatca gctactttgc 
252601 ttccgacttt ttt ttttctc tgaacaatac tttaattaga actttaacct aaacttataa 
252661 caggaaatac aaagctcaaa agattcagaa actaagcagg catataatat attatctcct 
252721 gagactaaaa gcaaatacat aaacgctcag tcaaaagcca caacactccc ttcataaacc 
252781 aaggcaaagg atcgctggag acaatcaaaa caaagctcta aaagacagga ccaccagaaa 
252841 ttaagaacat ggaaagaccc caacattgta tgacctggcg aggccacaat gatagttgtg 
252901 gtaactaaca attcctacaa gaactatcac gaatcacaaa caaggctaga actttagggg 
252961 caaagagaaa ccaccaggac agatccgcct cacataacgc taccggcaac cgcaattgga 
253 021 cgaaccaaag attgcggata gaacccaggc cggcaaagat cagcaacggc aaagaggagg 
253081 aaactgagag agctttcctt aagatagcat aaccgtaaaa acacgatctt ccacgagcgg 
253141 gagaagcaga acccaggagg tcacacccaa acaccagcaa cgacaatacc atcaagcgat 
253201 acaacagcaa gggcaatcac aagacatggg ttaaaccaaa caccgcgaaa ataagccaaa 
253261 ccctgagcca tagctccaaa gggaaaccaa atctgaaaac tcagaacgca ..aaaacacttc 
253321 tcaatcgcca caaacgagga gctcgaacag taacaaagga agaagctgag gactaaaaac 
253381 cgttgactgt ctaaacggag gcggagccgg agcgctgctt cagacgaacg aagcctagga 
253441 agccgttgct tcaccctaca aaaacagcga cgaaacagag ctacactaga aaaagacaaa 
253501 ctataaatag aaaagagccc agctccggtg ctatggtagc caccggagcc agcgacgaga 
253561 ttcgcttgga tctgggtttc tagagagacg tggagaaaga gagagaaaac tttggaatct 
253621 agataaaatg tcatatgatt gaacgacgtt tccgactttt atcgtcttct tgcctccaag 
253 681 ccttaagctt cagaatatgt gagagagaga gaggtggaag "caaataatca tccaattaca 
253741 acattgtgaa actgtttagt ttgtgaaaaa gcttcctcga tttcaaactc atagctctcg 
253 801 ccagcggaag atcgtcatga tcgtctttgc aaccgcgctc tagtaatcca gattcgattt 
253861 gcttctccgt tctttatccc tttcgaattc gtcccttttc ctgatcccaa tgtgtaaatc 
253 921 ttccaaagaa gctatagatt gtaaggaggg agggatagct tgtgacagca caaatgtcgt 
253981 ggtcgttacc acaccgcctt atgttttcct atttagtttt atttcctatt tatcattatg 
254041 gaaaattatc gtttgttaag ataatgatcc aaagcttttt ccaattcgga ttttagcgtt 
254101 ttatatataa ggctatgcct cgatgatata tcaataataa gaataaagtt tacaaatcat 
254161 cttcctcctt attactttct gtgcaagcaa tacaatatcc ttacaatctc aggttcttag 
254221 agctgatcct tcttcctacg tgatcaactc ctaggtctgt gaatcccctt ccttctaacc 
254281 tgcaagcaag gttactagtt tgggacttgt accccgttat cctatctcat tggtatcaga 
254341 tacatagact cctcaggacc caatcatgga gacaaggtaa accacatcct tgagcgacat 
254401 gagcaagcaa atagacgagc tacgttcttc gcaaaaccag caaactgaag aactcggtag 
254461 caaaatcaac gcactcgaag cactcatcga aaagtacttc gctaacgctc cacccccgca 
254521 acgcgacggt aaacaaacag acgcaagttc tgatattacg gatggaacac cgcaagctaa 
254581 ggctccacca gaccgttcca atccagagaa cagttccttt aaacctcacg acaacaataa 
254641 cccacccatc catcatagcc tatccgcaag gctaacaaag attggctttc caatgtttga 
254701 cggctccgaa ctacgagaat ggacctacgg ctgtgaacag ttcttctcca tcgacagcac 
254761 cccaccggaa ttgaaggttc gtcttgcatc tcttcatatg acgggaaaag cactacaatg 
254821 gcatcactct taccttgcca atcgatacaa catcttttca ttatggccag aatatgttgc 
254881 tgcgatctcc gatcgtttca gtgagcttta cgacgatcca ttagcagagt tggtaagctt 
254941 gaaacaagga aacgatacca tcgatgtgta tctagataaa tttgattgcg ccatgaccag 
255001 aatcacgctt gcgccggatc acgcattgag tatattcttg acaaacatga atcaacatct 
255061 agctcttcac gtgcgccaat tcaaggtcag tacagtacct gaagccgcaa agatagccaa 
255121 actacacgag ctctccctct cacatatgcc aacaaagaca tcgcgccccc cattcaactc 
255181 ttctcaacga tcaaactact cccaacccaa taaaagccaa aaccacaact ccacctcccc 
255241 tactactacc gccaatccaa acaacaaacc cctcattgcg aatgctcctc aaaaatggct 
255301 ttcttttgac gagatgcagg agcgcaaacg taaaggatta tgcatgttct gtgaagaacc 
255361 gtttacacca ggtcaccacc ttaagcataa gcgcgctgaa tttttgttct tggacttaga 
255421 cgctgagaca gaatttgacg acgagattgc actagtggag caaatccgtg agacaaccat 
255481 aagcgacgat gatgacaaag ttccaactat ctccgtccac gccctcaacg gtgcgccaac 
255541 tttcaactgt atgcgcctcg tcaggaaata cgagaaacgt aaactacata tactgattga 
255601 tccagggagc acacacaact tcttggacat ccagatggct aagggattag gttgttcttt 
255661 gacaccaatc aaaccaatgt cagttgttgc agcaagtggc gacttggtta ctaagtacaa 
255721 gtgcagctct tttgcttgga agatgcaagg ctacgggttc acagctgaga ttcgaacctt 
255781 accactagga tgcagtgatc tcgtcctggg ggttcaatgg ctttccacct taggaccaat 
255841 cctatgggat ttcctaaacc ttcgtatgga gttcaaattc aacgagctaa aacatgtctt 
255901 acgtggaata tcacccaaca gctcaaaatt aatttctgga agcagcttta acaaactgat 
255961 gttgcaagat ccacagcttg ctctactcca tcttcgcgag attgacgaaa ctacagagca 
256021 agaacccctt gaaccggaaa caattttctg tcatattgaa gcgagtgaaa cagagaacga 
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256081 caattccggt tcactcgaga gactccttga 
256141 gaccttacct ccttatcgcg cgggtttcaa 
256201 tcctgtgaat ctccgaccat atcgatactc 
256261 gattcaagac atgctttctc aaggtatcat 
256321 tattgtactt gtgaaaaaga aagacgggtc 
256381 caataagcaa acgatcaaag acaaataccc 
256441 attgggcggc tcaaagtact tctctaaact 
256501 tatgtctcca gaagatgtgc acaagacagc 
256561 tttggtaatg ccattcggcc tcacaaacgc 
256621 cgtattcgca cccgttctac gaaagtttct 
256681 cagcaagacc tgggaggagc acctggatca 
256741 tcaacaactc tatctcaaga aatcaaagtg 
256801 tggtcacttt atttcccatg atggcgttag 
256861 ggaatggcca caaccgaagc accagaaaca 
256921 ctatcgaaga ttcatacaag ggtacagcat 
2 56 981 caaagacggt— tttgcttgga "atacagaggc 
257041 attaatctcg gccccggtcc ttgcactccc 
257101 cgacgcttcc aacactggca taggcgcaat 
257161 cataagtcgt gcattagggc ctcgacacca 
257221 tgctgtggtc cacgcagtac agacttggaa 
257281 tcggacagat cagaaaagtt tgaagttcct 
257341 catatgtggc tatcaaaatt aatgggttat 
257401 gaaaacgtcg ctgcagatgc tttatctaga 
257461 ctctctctca tgctcaccac gggttctatg 
257521 ctactctgaa acagatcatt accgacctgc 
257581 cctttgtcaa cggagaactt cgccgccggg 
257641 tcaaacttca tatcttcaag tggcttcatg 
257701 atgcaacact acaccgcatc aactcgttgt 
257761 aaaactatgt ccgcaactgt ccagtttgcc 
257821 ctggactcct ctagccgtta ccggttccta 
257881 ttatcgaggg tctaccacca tcatcaggaa 
257941 taagcaagaa tgctcacttt ctcgccttgt. 
258001 agctatatat ggatcaggtc tttcgtcttc 
258061 gtgacccaac gtttctcagc gaggtttgga 
258121 aaacttctcc accgcttacc atccacaaac 
258181 tttggaaacc tatctccgat gcatgacttc 
258241 acccttagca gaatggtggt ataatactac 
258301 cgagatcatc tatggccagc cacctccagt 
258361 ttctaccacg gttgacaggt ccttacaacg 
258421 ccacctcttg cgagctcaga acaggatgaa 
258481 tgcgtttcat attggtgact atgtctattt 
258541 caaaggacga catttgcctc acaagctttc 
258601 agatcgtgtt ggtaacttgg catataagct 
258661 tgtcttccat gttagtcagc taaagctcgg 
258721 tccccaatac ctcaaagatg tcggcactgc 
258781 aatggtgaat cgccgaaaca gagctgtcac 
258841 tccggaacaa gccacatggg agttctatca 
2 58 901 tacttgaggc caagtatcat ctgaaggagg 
258961 gttaccacac cgccttatgt tttcctattt 
259021 attatcgttt gttaagataa tgatccaaag 
259081 atataaggct atgcctcgat gatatatcaa 
259141 ctctttatta ctttctgtgc aagcaataca 
259201 gatccttctt cctacgtgat caactcctag 
259261 agcaaggtta ctagtttggg acttgtaccc 
259321 cattctgttc tcggaagtta aggtcatgta 
259381 ttcctgaaag ttccatcaag aataattgtt 
259441 attttggtca catgagcatg atgaacataa 
259501 cagaggttaa gaagttgatt ctggtaacat 
259561 aaacacttat ggatgccaaa caagcaaact 
259621 aggagtgaag gccttctcta tgaatcatat 
259681 ggtgtctgac tcacgccatt cccatcagcg 



ttcatacacg gacgtatttg atgagccgtc 
tcacaaaata ccattggaag caggatcaaa 
ttcgatacaa aaggattcaa tagacaagat 
tcagtacagt gcaagtccct acgcttcacc 
ttggcggctt tgtgtcgact acagaggtct 
catcccacta ctcgaagatc tccttgatga 
ggatttacgt gcaggtttcc accagctttg 
ttttaaaaca cattcaggcc actacgaata 
accttgcacg tttcaggggc ttatgaatca 
ccttgttttc ttcgacgata tattaatcta 
cttggacaag gttcttgcta tacttcgcca 
tacttttgga ggaacgagaa tcgaatacct 
tactgaccca accaagataa aggcagtcga 
cctccgcagc ttcttaggtc tagccaatta 
"tatcgctcga cccct-tacca tcatgcttcg 
ttcagacgcg ttccatctcc tcaaacaagc 
cgatttctcc aagactttca tcgtcgaaac 
tcttatgcag gacaatcacc cagtgtgcta 
aggcctttct gtttacgaga aggaactcct 
cccttacttg gctcacaaca agttcattat 
actagaacag agatcaccac tacattccaa 
aactttgaga ttcagtataa agaaggcaag 
gtctccggtt cccaactact tcagctctct 
attctttgag gatgctctgg gatacatacc 
aaaacaaacc tgggtctcac gccgcgtact 
gtaaacttgt ggttggtaac gaccctgcta 
attctgctgt aggtggtcac tcgggtcgag 
tcttttggcc aaagatgagc ttggaggttc 
agaaaaataa atatgattta gctgcgaaac 
cgggtgtatg ggagtctgtg agccttgact 
aacattgcat tctcgtcgtt attgatcgtt 
ctcatccgta cactgcaatg gacgtggcaa 
atgggatgcc taaagacatt acaagcgatc 
gagagatgtt ccgcgttcat gtgtcgattt 
tgatggtcaa accaaggtta ctaacaaaac 
tgattcacct tctacatgga gcgcgtggtt 
ttaccacact gcgattcgca gttcaccgtt 
gcacttgcct taccttcccg gcgaaagtac 
aa gggaagag ctcattgaca tgatgaagtt 
acaatacgca gactcgcacc gttctgaacg 
gaagctccag ccgtatcgac aacattccct 
accacgtttt tatggtccgt acgagataca 
acgtcttcca tttgaagcag ctatccataa 
ccccaacccg cctgcaactc cttcgtcgct 
aaaggaacca gaaaagattc ttgaaacaaa 
gaaggtacta gtacagtgga aaggctactc 
agacttcgtc gcgaaacatc ctgatttcaa 
aagtattgtg acagcacaaa tgtcatggtc 
agttttattt cctatttatc attatggaaa 
ctttttccaa ttcggatttt agcgttttat 
taataagaat caagtttaca aatcatcttc 
atatccttac gatctcaggt tcttagagct 
gtctgtgaat ccccttcctt ctaacctgca 
cgttatccta tctcatagct ttatgaccta 
tgtttcatgc catcagaaga attgtttttt 
gttaccctct taattgaaat ggataaattg 
acaatcacat aaactagaac ttagaaaatg 
gatcatgaat ctattgacga acataaataa 
cttgattata agaggtgtct agaagattgg 
tagcatgagc aatcggccta gcttcatttt 
gttcaaagtc atcttggcga aaaacctttt 
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259741 gatgagatat agaaaatcag tttaattaaa cacaaaccaa atttgaatga cttctccagc 
259801 aatgttaaaa aagtaaagat gttgccatga catacgcagg cgtatgttga caaaccgacc 
259861 aaagagtctt ttaaaatttg acaacagtta aaaacggatt ctacagctaa tatatgcatt 
259921 aggtataaac atcataaacc tgattcaggt ttatgattca gctaatatat gcagcaatcc 
259981 ggtaatgaaa caaaatatta atggttcatt agttagaaaa agattatcac gtgattaaca 
260041 aagacttgtg tgatccttag tccaaaaaaa aaaaaaaaca aagacttatg cgacgacgac 
260101 gacaaacaaa aaaaacaaag acttttgcgt ctagttgggc ttaaataata aacatatagg 
260161 cccacgcaac cgaatattgg gaagtggaca cgtcattagc taccttgctt ccttccgacc 
260221 tttatcgtct tctgccccca aacctcgaga gagagagaat agagatcgaa gcaaagaatc 
260281 atcatcgaat tacaacattc tgaaattctc accttcttcc gatccacaga ttattccttc 
260341 actgctctct cagtgtctaa taatccttgg aacgcttcct cgatgtcaaa ctcatagctc 
260401 tcgtcagcgt cacgatcgtc tctttgcaat cgcgctctta gtaatccatc ccagattcga 
260461 ttgctaccat tctctatccc ttttcctgat cccagtattt aattttcatc cgcggctgaa 
260521_aactccaaac attaggggaa gatgattcat tggggtggag taacctgctg cctcagcgcc 
260581 gccgctcttt atcttct egg ccggagtagt ggcagggacg ctgaagtact caaaaccgtc 
260641 actagggtta accaactcaa ggagctaggt catcttctcc cccaaattgc ttcatctact 
260701 ctctctcata tatattattt ttgtctgatt cttattaata cttgtttttc ggatttacta 
260761 tgtattttgc agegcaattg ctagaattag atagcagcaa gctccttcct ttcategtag 
260821 ccgtttcagg aagagttggc tctgacactc ctatcaagtg cgagcatagt ggcatacgcg 
260881 gcgttatcgt cgaggaaacg gtacttatta atctagtttg tgtttcccgt tttgtcagtt 
260941 aggtgtatct atgeaaatea tatatagttg atgtgtttca teaggeggaa caacattttc 
261001 tgaaacacaa tgagactggt tcttgggtac aagatagtgc acttatgeta tctatgagca 
261061 aggaggttcc ttggttcctg gtatgtcaaa atatcaagtt tgcttactgt ttggatgttt 
261121 ttttttgaaa aacaatctct cgaatttcga aattgtttgt gaaacaggac gatgggacaa 
261181 gtcgtgtgaa tgtagtggga gctcgtggtg caacaggttt tgetttgace gtcggaagtg 
261241 aagtttttga agagtcaggg eggtctcttg tacggggaac acttgattat ctccaaggcc 
261301 ttaaggtatt tttgetttte tattcggttg cttgtttgtt gttattgacc tctctatttg 
261361 etttaagegg gtatcgtttt ggtttcagat gcttggagtt aagcgcattg agcgtgttct 
261421 tcctactgga atgcctctca caattgttgg tgaggtatgt egtattctea gtgtcttctt 
261481 ttttgtttaa aatttcttaa gaatagtatt tgtagcaagc tggattgatc tttctgeaga 
261541 ggcagactga tgagtttgat ttaatttctg ttgettgeat gtattgeagg ctgtcaagga 
261601 cgatattggg gacctaagga ttcagaaacc tgaaagaggg cctttctacg tctctcctaa 
261661 atcactcgat cagctcattt ctaatctggg gaaatggtca aggtcctctc tctctctcct 
261721 ctctcacttg gttcttcttc tectatgeta tttcagaaaa agacgatgag ccaaactgtt 
261781 ttttacaggt tgtacaagta tgcctccatg ggtttaactg ttttcggtgt gtttctaatt 
261841 acaaagcatg tcattgattt tcttctagag agaagacagc ggcgagaaat acagaaaagg 
261901 tatgttaaac atttgtctgt etataaageg aataattgtc ctcaactaga tcagttttgc 
261961 ttccctcgga aattatatac acaaaaatat taagcagtgt cagctgttgt tgcagagtgc 
262021 ttgatgeage agctaagaga gctgggactg aaggtatcca ttgggtgaaa tctttttatt 
262081 ttccactcat aaattggtag cacagtttct gaccacaaaa cttttctctc accaggttca 
262141 aacggcgcac atgagagcgt atcagattct accaagaatg aaggcgctgt tcctgatctc 
262201 tgtgtgatct gecttgagea gaactacaat gctgtgtttg tcccgtaagc attcttctgc 
262261 ttcattttgg ctgtttttta catttgeaaa ttgctaaaaa gagcgtgtgt tgttgttact 
262321 tetegcaggt gtggtcatat gtgctgctgc accgcatgct cctcccattt gtccagctgt 
262381 ccactttgtc ggagacgtat_jagatcaggtg gttaagacat ategtcactg agcaaaactc 
262441 aactcagggc tcagaagcat tctctactcg agttgcctgt aaatactgea agatcaaaac 
262501 gatactaaat tggtgttcgt tatcctctct tgtttgcctt caagaacaat acatttgaac 
262561 tatataaaaa cataaggtgt ttgcagtaag ttcgatttaa ttaatcttgc cgctgccaat 
262621 gacattaggt gtttgeaget taacctgttt gaatcttcct tgcttcactc ttgattgact 
262681 atatctgtct ttttccccaa aacaagcttt aagtgtttct cacaaaatgg gagatactaa 
262741 agcagagaag ccagctccaa caatctgtac gattctcagc tttgccattg ttgtcttctt 
262801 tcgtgtagat tattccttca gtaatattag tgttgagaga agagattgag tcgctttgtt 
262861 ggcttgatca aacccaagcc ttcttatctc tttttccacc gtaaacacat gggcaccaca 
262921 ttcaacaatg ttgagttcgt ggtaggtact gatgaattcg aacatcatct cattctcctg 
262981 ttcaagecat ccatgttctt tagcatataa atagatgtcc cgcttggatg catagacatg 
263041 ttcctgtcag tgaagegace ctagtttatg cgcacgcgac acaaaaatct gctagctact 
263101 tcaatctgtg tggtttaggg gcaaccactg aaccacgatg caaaacctaa attctctgga 
263161 atctagtaca taaaatagtt tatatatatg etatgeeggt ttttgtgtta caaaaagaaa 
263221 ttactatgtc aaatgtgaat catgtcaatc gtataacttc caaatcatgg ctggatttta 
263281 aacggttcaa tattctatat aattttcaat caacaaattt gtttctttta tcaccttcat 
263341 attctatgtt ttgttatcag tatcattggc atctcatcca tctttcattt gtttcaggtt 
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263401 atcaagttgg atatcggcca tatcgaccca tcgatcacat ctaagggcgg tcggttagaa 
263461 cttagaacct cggaaaccaa tctggtcgac ccagattctg agacagagat aatcagtatg 
263521 acagatagat cctaactagg agctaaccaa tagctctaag aaaggtgaat actagtctag 
263581 aaggggactg tgagttttag -aatagtcacc tcgaagatgc tagatcctta ggatataaga 
263641 acagagagtt ttagaggttg gataaaatga tttctgatag atctctgcaa atgcaggaga 
263701 tgagtacaaa tactaaagtt cgagggtcac actaacggtc taaaacaatg acctgttaat 
263761 gacgacaaaa gcataaagta aaagcataaa gcaaaaggtg aaatgagctg gatgtatttg 
263821 ctgactacgc ctttttgtga tcaggttgtt aggcgtgata ctcctgtgac aatactccct 
263881 tcttcagagc aaccttgtcc tcaaggattg aaaagaggat acttgttggt gaagtccttg 
263941 tagaactccc aagttgccat ttcaggtgat tcatctttcc agtggactaa aacttttgtt 
264001 gcagcttcgt tttggcgctt aacagtctta gtctccagaa ttgctgcagg ttctttggtg 
264061 ttgccgaggt caagccagta ctgaggaacc gcaggggagg atgaagcagg gtttgggcac 
264121 agctttagtt gactcacgtg aaatgtgtca tgaattgcag cttcagtagg cagcgagagt 
^ 264181 ttgtaggcaa ccttgccgat acggtctagc acacggaagg gaccataaaa gcgtggcgag 
264241 agcttgtgtg gagccttatt ctttttcact gtatgctggc gatatggctg gagcttcaag 
264301 tatacgtaat caccgatctt gaactctcgc tgggaacgct ttgcatcggc atactgccgc 
264361 atacggttct gagcgcgaag caagtgaaat ttaagcatgt taatcacttc ctcacgcttc 
264421 tggaggctgc ggtctacaac aacggaggag ctttcacccg gaaggtaagg gaggtgaagt 
264481 ggtgggggtt gaccgtagat gacctcgtat ggagtagctt gaatggcaga gtgaaaagtg 
264541 gtgttatacc accattctgc aagactgagc catttgctcc aggatgttgg tgtctcagca 
264601 gccatacagc gtagataggt ctccaaggtc ttgttggtta, cctctgtttg accgtcggtt 
264661 tgagggtggt aagcagtgga gcgcttgaga gtaacccctt gtacacggaa gagctcgttc 
264721 caaacctcac tgataaagat cggatctctg tcactgatga catctttggg catgccatgg 
264781 agtttgaaga tgttgtccaa gaatgcctga gcaacagtga gggccgtgta tgggtgggac 
264841 aagggtataa agtgtgcgtt cttgctcatc ctatcgacca ctaccataat acagtgtttg 
264901 ccctgagacg gcgggagacc ttcaatgaag tctaggctta tagattccca gatcccatca 
264961 ggaattggta gaggttgtag aagacccggt ttggcagcca tatcgtactt gttctgttgg 
265021 cagatactac agtttcgaat gtagttctgg acctccacgt ttagcttagg ccaatagaac 
265081 aaagatctga tacgatgtaa ggttgcgtca cgtccagagt gtcccccaat ggcagagtcg 
265141 tgaagccatt tgaagatgtg caacttgata tctttatcgt ttccaacgac tagcttgcct 
265201 cgacgtctca gctctccgtt tgtgtatgtg aagagggggt gtgaagagct gttagatttc 
265261 aagtcgcaga taatcttctg gagagtagca tcggtttccc agagtagctt caacgaatcg 
265321 tagaaaccag tatgagcctg agatagtacc atgttcagta actgagagcc agagactcga 
265381 gagagagcgt cagcaacaac attgtctttc ccttgcttgt agtggatctc gaagttgtag 
265441 cccataagct tagagagcca catgtgctgg aaaggtgtag ttattttctg ctccatgagg 
265501 tacttgaggc ttcgctgatc agtgttgatg atgaaagggc gatgcgcaag gtaggggtgc 
265561 caggtttgca ctgcgtgcac taatgccatc aactcctttt catacacaga caggttttgg 
265621 tggcggggac caagggagcg acttatgaag cagatagggt gttttccctg cataaggacc 
265681 gcaccaatac cggtgtttga cgcgtctgtc tcaacaatga aaggttgatc aaagtcgggg 
2 65741 agggctaaga caggcgtgga actgagcgct gctttgaggt ctgagaagga ttgatcagct 
265801 tcaagggacc aagtaaagcc atcttttttc agtagcgtgc tgagggggcg agcgatggag 
265861 ctataacctt tgatgaagcg gcggtaatag tttgccaatc ccaagaagct ccgtaggtgc 
265921 ttctgtgtcg tagggattgg ccaatcacgt attacagcaa tctttcgggg atcggtgctc 
265981 acgccgtcag cagaaataaa gtggccaagg tactctatga tcgttgcacc aatagtacat 
266041 ttagagagtt tgagatacag ctgttgttgt tgcaggatca ggaagacctc cgcaaggtgt 
266101 tgtaggtgtt cctcccaagt cttgctgtat acaaggatgt cgtcgaaaaa gacaagcact 
266161 gatttccgag agaggtcttg aaatacatga ttcataagac tctggaaagt gcaaggggcg 
266221 tttgtcaagc cgaaaggcat gacaaggtat tcatagtgcc cttgatgtgt cttgaatgct 
266281 gtcttgtaga cgtcatcctc cgacattcgg agttggtgaa aacctgctcg aagatccaac 
266341 ttagagaagt acttggcgcc accaagttca tcgaggaggt cttctagcag agggatcaga 
266401 tacttgtcct tgatcgtctg cttgttgagc cctctgtaat ccacacaaag acgccacgtg 
111^ c f atcttttt tcttgaccaa gacgatgggt gaggcgtagg ggctagcact gtgctgaatg 
266521 atcccttgag tcagcatttc cttgatcatc gtatcgattg tatctttttg aagagaagag 
2 66581 taccgatacg ggcgtaagtt gacggggttg gagcccgcaa ggagtggtat ctggtgatca 
266641 aaaccctctc taaacggtgg cagtgactta ggttcttgaa agatatcttc aaacgtttga 
266701 agaagttgtt ggagagctgg gtcgtcagca tttgttgttc ccgaagcaga gatgtgagag 
offZnn* ^ acaacattt caggattgag gctttgctgg ttgtcggcat tctcatcaat ctgttgtagc 
266821 tggattaagg ctatttgagg ttcttggagc atgagcttgt tcaagctgga gcccttaata 
lAll^ accttgcctc ctgtcttaac gacgccacga agtacatgtt tggtgccaga gagtgtgaac 
lllrll tccattct 9 a ggttgaggaa gtcccaaaga ataggtccca aagtgcaaag ccattgcact 
267001 ccaagcacaa aatcgcagca gtcaagaggg acagtgcgga tttccgtcgt aaaggaggag 
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267061 ccttgaacgg tccaagtgaa agccgagcac ttgaagtttg tgagaagggt gtcaccagtt 
267121 gcagctttca ctgacattgg gcgcgttggt tcgagctcac aacccagttg agtggcgatg 
267181 ttgaggtcca caaagttgtg ggtacttcca gggtctacaa gtatatgcag cttgtgtttg 
267241 ccgtgatgac caatcaagcg catacagttg taggaggtgg agccgtttaa ggcattgatg 
267301 gagatgactg gtgtgacttc aacagtcgct tgcttgtcgt ctgactctgg ctcggcatca 
267361 gaggagctat cgtcagggat ggtatcagca tcatcacact ccatgacata gatctgagag 
267421 cgcttgtgtt ttagttgatg gccaggagta aatggttcat cacaaaacat acataagcct 
267481 ttcgatcttc tgtcttgcat ttcctggtag gagaactttc gaggtggttt gtcagtgggg 
267541 tttcggggaa taaaggaggg tttttggttg gtgggggctt. gggttagagg taggaggggg 
267601 gaagggttgt ttggtttttg gtagggtttt gtgtttgggt aaggattgaa gggggctctt 
267661 tgtcggtttg gggtatggga gagagaggat tcatggagca tagcaatctt tgcagcgccc 
267721 gcaatggagg ttgtttcaaa ctgtctcgtg tggagagaga gatgagggtt catgttagcc 
267781 aggaaaatgc taagagcatg agcctcagga aggacaaggc gcatccttgt cgtctcaaac 
267841 ttgtcgagat aggcagcaac caaatcggag ccttgcttga gggccactag ctcagcaaga 
2 67901 gggtcgtcaa agagttcaca aaaacgagca gatatggcga ttatatagtc tgtccaagag 
267961 gggaacatgc caaacc.tagt gctcatgtag tttgtgtgcc attgagtagc tcttccggtg 
268021 agatgcatcg cagctaagcg taccttcagc tcctgtgagg tgccgtcgat atcaaagaac 
268081 tgctcacact tggaaagcca gtcgcggagg . tcagtgccat cgaactttgg gaacgagatt 
268141 ttggaaaggc gagatgtgag tctgccaaac tcgttgtgtt gacggaatcc gttgagatct 
268201 ggggggtcag gtggttggga agaaccgagg ttgaagggtg tcggagggtg ggaggggccg 
268261 gggtctagtg gggcttttcc ggtggcgtgg agcggtgagg cgttggcgga gaacatcagg 
268321 gcctcgagtc tgtcgaaacg agcgtcgagg gagtcggaac gagacttaag ctcagcagcg 
268381 aggaggtcgt gtaaggaccg taactcgtca acttgctcag tcagagatct ctcttggagt 
268441 cgtgtttcaa ccatggtttc ggtaccgagg aaaggtgact tctgatacca atgagacaga 
268501 gataatcagt atgacagata gatcctaact aggagctaac caatagctct aagaaaggtg 
268561 aatactagtc tagaagggga ctgtgagttt tagaatagtc acctcgaaga tgctagatcc 
268621 ttaggatata agaacagaga gttttagagg ttggataaaa tgatttctga tagatctctg 
268681 caaatgcagg agatgagtac aaatactaaa gttcgagggt cacactaacg gtctaaaaca 
268741 atgacctgtt aatgacgaca aaagcataaa gtaaaagcat aaagcaaaag gtgaaatgag 
268801 ctggatgtat ttgctgacta cgcctttttg tgatcaggtt gttaggcgtg atactcctgt 
268861 gacagattcc agcctgtatt acttcgggat cttttttttc tcaaattcct ctatcgtccg 
268921 ttgagattca cttcttcggc ggcactgagc tcgaaattgt cgtctgcttt ggcggtactg 
268981 ttcactcaat tcgtattcgt cctcactcca ttcgcgctcc tcactcgatt catacggtgt 
269041 cgcctgtaaa ggattcgctg ggttcaactc aaggctctcg tcgatcagac tattaatatc 
269101 gtcttcacat gattcgtacc tcccatcagt gaggtaatca atgtggtctt cagacacttg 
269161 gtctgatcca gattcatagc tcccttggat gctcttgtct tcaaacactt tggcaccaca 
269221 ttcaataacg tcgaattcgt gagatttgct gctgaattcg aacactgtct ctctgctgac 
269281 ttcggagggg acgatgaatg gcaagcgagg gtgaaatata agtagatgtt cagatcgata 
269341 tctggagaca tagcttacgc aaaactcctc aacaacgggg tctaggtcgc ctttgcttat 
269401 gcagcggcac actaagtatt ctgaatgtct actttttttg ctgatttgat ggttaggtga 
269461 caccacaacg caaagcttca atatagatac agcagaaaga ggattgttac catagggaat 
269521 tctaatggtc aatgaacttc ctctggctcg gtgatcaaac ttggcaggta ctttttttcc 
269581 tggtaagata acccattcac aagacaagga acggtggata atttctcttc gtgcttgttg 
269641 gcctaatttg aagcagttgg tgaaatcgag tactgcattt ggactgctat tcaaagggca 
269701 aaacacggtc tccagtgatt cgcaatcttc cgcatttaag gatttgatcg aaccagggag 
269761 ctctggcaat gatgcgagtc ttctacatcc atcaagggag agttgctcta gctgatgaag 
269821 atctttgatg caatctggaa tcgtctcaat atcagaatac cgtaggtcta gacatattag 
269881 actcttgtgg agatgtgtta atcccttgag tttcccactt ttccttatac gagagtacat 
269941 accagatgat ggcggccgtt gtcgaaaaga gtccccacat atactcaatc tcggtgaagt 
270001 agttgaaata aacagaaact ttctcaattt tgaacatccg ttcatttcga caattccaag 
270061 agaagccaag ttcaggtggt ctggaatgac ttgtaggttt acgcaatcat ccatccacca 
270121 caattctagt ttctgaagat gcataaaaga ggatggaatc tctaccaaac tcttgcaata 
270181 actcatatcc aatatcttca gatttgttgc tcttgaaaga tccgggagtt ccttcaaotg 
270241 gcatgaaaat gacagatcaa gcttcttgag atttgcaagc cgctgaattt gagaaccaca 
270301 caacaacaga gggacatcat ataacatgaa gagaagataa ttcatatata tatatatata 
270361 tatattaagt acatacacca acctgggttc ctttccagag gtgctcgaat ttgccatgct 
270421 tcatatcaag ttccacgaga tattcaggat gaaaggaagg aggaagacac ttgttagggt 
270481 atgcattcca atgtagaaac cttagacggc gtggaaactt catctcctca ggtatatgta 
270541 ttacatcatt catatcatct ctgctttttt ccccagcaag aatgtcaaat tttagtttgt 
270601 catctctgct tttgtagact ttgagaaact gaagattata cattctttta aaagctcttt 
270661 cgtcaacaat caggtcgttg attcctgata tatcaaatga tatgccggac acagctctag 
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270721 ttccctaaac caaattacag ataaaaaata aaaggttagg tccctatgtc agattttgcc 
270781 tagaatgact ggtctgatat aaaaataatt accaaggaat caaacagata tatatatata 
270841 tataaaaaag tggaaaagaa tttactgttt cgttttcgag gacatcgcaa atctcatgcg 
270901 catccagtaa gatttggcgt ^ttccaaggct cttgtttatg aatggcttgt ctacccactc 
270961 gtcgtagtag cttgtgcatc tcgattatat tcccatatgt agatatctct atgagagatc 
271021 tatagactag gatctgcaaa ccgcgtttga catctattgt accgtcagag aacatggctt 
271081 tcacaagatc actatcttca ttgttgaaga agactgcaat gtggagaaat agggtttgct 
271141 cattctcctc taaactttca tagcccactt ttagcacgtc ctctatacct tgatcaagaa 
271201 tagtttccag tctgttcact acatcttccc attcgtcctt cttcttccca cgtaaagatt 
271261 tacccaccac acatagagcc aatggaagat taccacaaag ccttattatc cttgtacaaa 
271321 gcacctcaaa accataaagt ggagacctta gtctaaaagc acatctacat aagatctcga 
271381 tagcttcttc atcaaatgga aaccccacat ggtaagtgct accgatacca tgttgctgca 
271441 aaatctctag gttttctgtg gtcactacaa tcctactccc aggaccaaac catgcactct 
271501 _ cattagccaa cgcctctaat tgctttagac tgttcacgtc atcaagaatg ataagcactt 
2715*61" tctggtcgca taggttttct tttactgcac "ctaaatggca tatcctcatg tccttgtggt 
271621 tcaaaacctt _tgaaagaaaa ttttcttgta accttttctt ccaaccatat tcgtcaagac 
271681 cactaggaca gatttcccta aggttgtcca caaaacaagt aagctggaac ctacgggaga 
271741 gtagactatg taaagctcga gctatggtag tcttaccgat ccctgcagga ccagtgatgg 
271801 caaccatctt aactccatcg taatccagat ctaacaaaga ctctatttcc tttagatgag 
271861 catcaagtcc cacaatgcct tcaaaatctc tacacggtgt tgcattcagt ttttgtaaaa 
271921 catcttttgc aatcttcgca atcattttag cttcattgac ccttagataa aaaaaaaaaa 
271981 acaatagaga 
(SEQ ID NO: 87) 



Gene 2 6 Product 

MLARVCGFKCSSSPAESAARLFCTRSIRDTIiAKASGESCEAGFGGESLKL 
QSGFHEI KGLEDAIDLFSDMLRSRPLPS VVTDFCKXjMGVVVRMERPDLVI S 
LYQKMERKQIRCDIYSFNILIKCFCSCSKIjPFALSTFGKITKLGLHPDW 

tfttllhglcvedrvsealdffhqmfettcrpnvvtfttlmnglcregri 

VEAVALLDRMMEDGLQPTQITYGTIVDGMCKKGDTVSAIjNLLRKMEEVSH 
IIPNWIYSAIIDSLCKDGRHSDAQNIjFTEMQEKGIFPDLFTYNSMIVGF 
CSSGRWSDAEQIiLQEMliERKISPDWTYNALINAFVKEGKFFEAEELYDE 
MLPRGIIPNTITYSSMIDGFCKQNRXjDAA£HMFYIjKATKGCSPNLITFNT 
L I DG YCGAKR I DDGMELLH EMTETGLVADTTT YNTL I HG F YLVGDLNAAL 

DLLQEMISSGLCPDIVTCDTLLDGLCDNGKiKDALEMFKVMQKSKKDLDA 

SHPFNGVEPDVQTYNILISGLINEGKFLEAEELYEEMPHRGIVPDTITYS 

SMIDGLCKQSRLDEATQMFDSMGSKSFSPNWTFTTLINGYCKAGRVDDG 

LELFCEMGRRGIVANAITYITLICGFRKVGNINGALDIFQEMISSGVYPD 

TITIRNMLTGLWSKEELKRAVAI^EKLQMSMVCKFLFSLCIFYINKNVYI 
LLCVASD 

(SEQ ID NO: 88) 
Gene 26 Product 

atgttggctagggtttgtggattcaagtgttcttcttctcctgctgagtc 

tgcggctagattgttctgtacgagatcgattcgtgatactctggccaagg 

caagcggagagagttgcgaagcaggttttggaggagagagtttgaagctg 

caaagtgggtttcatgaaatcaaaggtttagaggatgcgattgatttgtt 

cagtgacatgcttcgatctcgtcctttaccttctgtggttgatttctgta 

aattgatgggtgtggtggtgagaatggaacgcccggatcttgtgatttct 

ctctatcagaagatggaaaggaaacagattcgatgtgatatatacagctt 

caatattctgataaaatgtttctgcagctgctctaagctcccctttgctt 

tgtctacatttggtaagatcaccaagcttggactccaccctgatgttgtt 

accttcaccaccctgctccatggattatgtgtggaagatagggtttctga 

agccttggatttttttcatcaaatgtttgaaacgacatgtaggcccaatg 

tcgtaaccttcaccactttgatgaacggtctttgccgcgagggtagaatt 

gtcgaagccgtagctctgcttgatcggatgatggaagatggtctccagcc 

tacccagattacttatggaacaatcgtagatgggatgtgtaagaagggag 

atactgtgtctgcactgaatctgctgaggaagatggaggaggtgagccac 

atcatacccaatgttgtaatctatagtgcaatcattgatagcctttgtaa 

agacggacgtcatagcgatgcacaaaatcttttcactgaaatgcaagaga 

aaggaatctttcccgatttatttacctacaacagtatgatagttggtttt 
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tgtagctctggtagatggagcgacgcggagcagttgttgcaagaaatgtt 
agaaaggaagatcagccctgatgttgtaacttataatgctttgatcaatg 
catttgtcaaggaaggcaagtt ctttgaggctgaagaat tatacgatgag 
atgcttccaaggggtataatccctaatacaatcacatatagttcaatgat 
cgatggatt ttgcaaacagaatcgtcttgatgctgctgagcacatgtttt 
atttgatggctaccaagggctgctctcccaacctaatcactttcaatact 
ctcatagacggatattgtggggctaagaggatagatgatggaatggaact 
tctccatgagatgactgaaacaggattagttgctgacacaactacttaca 
acactcttattcacgggttctatctggtgggcgatcttaatgctgctcta 
gaccttttacaagagatgatct ctagtggtt tgtgccctgatatcgttac 
ttgtgacactttgctggatggtctctgcgataatgggaaactaaaagatg 
cattggaaatgtttaaggttatgcagaagagtaagaaggatcttgatgct 
agtcaccccttcaatggtgtggaacctgatgttcaaacttacaatatatt 
gatcagcggcttgatcaatgaagggaagtttttagaggccgaggaattat 
acgaggagatgccccacaggggtatagtcccagatactatcacctatagc 
tcaat gat cgatggatt at gcaagcagagccgcctagatgaggctacaca 
aatgtttgattcgatgggtagcaagagcttctctccaaacgtagtgacct 
ttactacactcattaatggctactgtaaggcaggaagggttgatgatggg 
ctggagcttt tctgcgagatgggtcgaagagggatagttgctaacgcaat 
tacttacatcactttgatttgtggttttcgtaaagtgggtaatattaatg 
gggctctagacattttccaggagatgatttcaagtggtgtgtatcctgat 
accattaccatccgcaatatgctgactggtttatggagtaaagaggaact 
aaaaagggcagtggcaatgcttgagaaactgcagatgagtatggtatgta 
agt ttctgttcagtctatgtattttttatataaacaagaatgtatacatt 
cttttgtgtgtagcttcagattga 

(SEQ ID NO: 89) 
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1. An isolated nuclear fertility restorer nucleic acid selected from the group consisting 

of: 

a) a nucleic acid comprising a gene within positions 88,073 and 198,041 of the 
nucleotide sequence of SEQ ID NO:87; 

b) a nucleic acid encoding a protein comprising an amino acid sequence encoded 
by a gene within positions 88,073 and 198,041 of the nucleotide sequence of SEQ ID 
NO:87; — 

c) a nucleic acid comprising a nucleotide~sequence differing from the sequence 
of the nucleic acids of a) or b) due to the degeneracy of the genetic code; 

d) a nucleic acid fragment of any of a) through c) above at least 150 nucleotides 
in length; and 

e) a nucleic acid having at least 70% homology with a gene within positions 
88,073 and 198,041 of the nucleotide sequence of SEQ ID NO:87, wherein sequence 
homology is determined by the Karlin and Altschul algorithm using standard 
parameters. 

2. The nucleic acid of Claim 1, wherein the nucleic acid comprises a nucleotide 
sequence selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID 
NO:42, SEQ ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. 

3. The nucleic acid of Claim 1, wherein the nucleic acid comprises a nucleotide 
sequence of SEQ ID NO:32. 

4. The nucleic acid of Claim 1, wherein the nucleic acid comprises a nucleotide 
sequence of SEQ ID NO:89. 

5. The nucleic acid of Claim 1, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence selected from SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, 
SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88. 

6. The nucleic acid of Claim 1, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ED NO:3 1 . 
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7. The nucleic acid of Claim 1, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:89. 

8. The nucleic acid of Claim 1, wherein the nucleic acid comprises a nucleotide 
sequence differing from the sequence of the nucleic acids of a) or b) of Claim 1 due to 
the degeneracy of the genetic code. 

9. The nucleic acid of Claim 1, wherein the nucleic acid is at least 150 nucleotides in 
length and is a fragment of any of the nucleic acids of a) through c) of Claim 1. 



10. The nucleic acid of Claim 9, wherein the nucleic acid encodes a protein having a 
pentatricopeptide motif. 

1 1 . The nucleic acid of Claim 1 , wherein the nucleic acid has at least 70% homology with 
the nucleotide sequence selected from SEQ ID NO: 30, SEQ ID NO:32, SEQ ID 
NO:34, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID 
NO: 89 and wherein sequence homology is determined by the Karlin and Altschul 
algorithm using standard parameters. 

12. The nucleic acid of Claim 1, wherein the nucleic acid encodes a protein having a 
pentatricopeptide motif 

* > 

13. A vector comprising a nucleic acid of Claim 1. 

14. A first isolated nucleic acid that hybridizes under highly stringent conditions to a 
second nucleic acid selected from the group consisting of: 

a) a second nucleic acid comprising a nucleotide sequence selected from SEQ ID 
NO:30, SEQ ID NO:32 5 SEQ ID NO:34, SEQ ID NO:42, SEQ ID NO:44, SEQ ID 
NO:52, SEQ ID NO:54 and SEQ ID NO:89; and 

b) a second nucleic acid encoding a protein comprising an amino acid sequence 
selected from SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:41, SEQ 
ID NO:43, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88, 

wherein the first nucleic acid encodes a nuclear fertility restorer protein. 
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16. A transgenic plant comprising a plant cell of Claim 15. 

1 7. The transgenic plant of Claim i6, wherein the plant is a hybrid. 

18. The transgenic plant of Claim 16 or 17, wherein the plant is a Brassica napus plant. 

19. A seed produced by the plant of Claim 18. 



20. The seed of Claim 19, wherein the nucleic acid comprises a nucleotide sequence 
selected from SEQ ID NO:30, SEQ ID NO:32 ? SEQ ED NO:34, SEQ ID NO:42, SEQ 
ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO: 89. 

2 1 . The seed of Claim 1 9, wherein the nucleic acid comprises a nucleotide sequence of 
SEQ ID NO:32. 

22. The seed of Claim 19, wherein the nucleic acid comprises a nucleotide sequence of 
SEQ ID NO:89. 

23. The seed of Claim 19, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence selected from an SEQ ID NO:29, SEQ ID NO:31, SEQ ED 
NO:33, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID 
NO:88. 

24. The seed of Claim 19, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:3 1 . 

25. The seed of Claim 19, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:88. 

4 

26. A method of producing a hybrid plant, comprising crossing a male-sterile plant with a 
male-fertile plant, collecting hybrid seed from the male-sterile plant and regenerating 
the hybrid plant from the seed, wherein the male-fertile plant comprises a nuclear 
fertility restorer nucleic acid selected from the group consisting of: 
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a) a nucleic acid comprising a gene within positions 88,073 and 198,041 of the 
nucleotide sequence of SEQ ID NO: 87; 

b) a nucleic acid encoding a protein comprising an amino acid sequence encoded 
by a gene within positions 88,073 and 198,041 of the nucleotide sequence of SEQ ID 
NO:87; 

c) a nucleic acid comprising a nucleotide sequence differing from the sequence 
of the nucleic acids of a) or b) due to the degeneracy of the genetic code; 

d) a nucleic acid fragment of any of a) through c) above at least 150 nucleotides 
in length; and 

e) „ a nucleic acid having at least 70%- homology with a gene within positions- 

88,073 and 198,041 of the nucleotide sequence of SEQ ID NO:87, wherein sequence 
homology is determined by the Karlin and Altschul algorithm using standard 
parameters. 

27. The method of Claim 26, wherein the nucleic acid comprises a nucleotide sequence 
selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ 
ID NO:44, SEQ ED NO:52, SEQ ID NO:54 and SEQ ED NO:89. 

28. The method of Claim 26, wherein the nucleic acid comprises a nucleotide sequence of 
SEQ ID NO:32. 

29. The method of Claim 26, wherein the nucleic acid comprises a nucleotide sequence of 
SEQIDNO:89. 

30. The method jyf Claim 26, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence selected from SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, 
SEQ ID NO:41, SEQ ID NO:43, SEQ ED NO:51, SEQ ID NO:53 and SEQ ID NO:88. 

31. The method of Claim 26, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:3 1 . 

32. The method of Claim 26, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:89. 
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33. The method of Claim 26, wherein the nucleic acid comprises a nucleotide sequence 
differing from the sequence of the nucleic acids of a) or b) of Claim 1 due to the 
degeneracy of the genetic code. 

34. The method of Claim 26, wherein the nucleic acid is at least 150 nucleotides in length 
and is a fragment of any of the nucleic acids of a) through c) of Claim 1. 

35. The method of Claim 26, wherein the nucleic acid has at least 70% homology with 
the nucleotide sequence selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:34, SEQ ID NO:42, SEQ-ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID 
NO: 89 and wherein sequence homology is determined by the Karlin and AJtschul 
algorithm using standard parameters. 

36. The method of Claim 26, wherein the hybrid plant is a Brassica napus plant. 

37. The method of Claim 26, wherein the male-sterile plant comprises a cytoplasmic male 
sterility determinant selected from the group consisting of pol, nap, Tournefortii, 
Kosena and ogu. 

38. The method of Claim 37, wherein the cytoplasmic male sterility determinant is ogu. 

39. A method of increasing production of viable pollen in a plant, comprising introducing 
a nuclear fertility restorer nucleic acid into the plant, wherein the nuclear fertility 
restorer nucleic acid selected from the group consisting of: 

_a^ a nucleic acid comprising a gene within positions 88,073 and 198,041 of the 
nucleotide sequence of SEQ ID NO:87; 

b) a nucleic acid encoding a protein comprising an amino acid sequence encoded 
by a gene within positions 88,073 and 198,041 of the nucleotide sequence of SEQ ID 
NO:87; 

c) a nucleic acid comprising a nucleotide sequence differing from the sequence 
of the nucleic acids of a) or b) due to the degeneracy of the genetic code; 

d) a nucleic acid fragment of any of a) through c) above at least 150 nucleotides 
in length; and 

e) a nucleic acid having at least 70% homology with a gene within positions 
88,073 and 198,041 of the nucleotide sequence of SEQ ID NO:87, wherein sequence 
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homology is determined by the Karlin and Altschul algorithm using standard 
parameters. 

40. The method of Claim 39, wherein the plant comprises a cytoplasmic male-sterility 
determinant. 

41. The method of Claim 40, wherein the plant comprises an ogii cytoplasmic male 
sterility determinant. 

42. The method of Claim 39, wherein the plant is a Brassica napas plant. 

43. The method of Claim 39, wherein the nucleic acid comprises a nucleotide sequence 
selected from SEQ ED NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ 
ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. 

44. The method of Claim 39, wherein the nucleic acid comprises a nucleotide sequence of 
SEQIDNO:32. 

45. The method of Claim 39, wherein the nucleic acid comprises a nucleotide sequence of 
SEQ ID NO: 89. 

46. The method of Claim 39, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence selected from SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, 
SEQ ID NO:41, SEQ ED NO:43, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88. 

47. The method of Claim 39, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:31. 

48. The method of Claim 39, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:88. 

49. The method of Claim 39, wherein the nucleic acid comprises a nucleotide sequence 
differing from the sequence of the nucleic acids of a) or b) of Claim 1 due to the 
degeneracy of the genetic code. 
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50. The method of Claim 39, wherein the nucleic acid is at least 150 nucleotides in length 
and is a fragment of any of the nucleic acids of a) through c) of Claim 1. 



51. The method of Claim 39, wherein the nucleic acid has at least 70% homology with a 
nucleotide sequence selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, 
SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89 
and wherein sequence homology is determined by the Karlin and Altschul algorithm 
using standard parameters. 
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Fig. 9 Gene 16 cDNA and Protein 
10 20 30 40 50 60 70 

I I I I I I I I I I I I I 

atgaggattgatgtttctgagccagagctatgcggttgcgacacttgtgtccagcatcggactttcatta 

MRIDVS E PELCG'CDTCVQHRTF I 

5 10 15 20 

80 90 100 110 120 130 140 



ctcaagaaaccgagccgagcaaagaagtgat tggctcatcggttcctgttagttccgaaccagttcaacc 
TQETEPSKEVIGSSVPVSSEPVQP 
25 30 35 40 45 

150 160 170 180 190 200 210 

| |-..-| | | | I | | | | | | | 

tcttggttccacctcagatgagagttcaggaacagagacgactccactcgctcctcctccagtcaccaca 
L G S T S D — E S SGTETTPLAP P P V T T _ 
50 55 60 65 7 0" 





220 


230 




240 


250 


260 




270 




280 


....I. 


...|....| 


|....|... 


-I- 


...|.... 


|....|... 


..|....|... 


•1- 


...|... 


•I- 


...| 



ccggttaataatcctgaaccagcagcgcagtctgttggctcaaccatcccacctgctgttacaccagtta 
PVNNPEPAAQSVGSTI PPAVTPV- 

75 80 85 90 

290 300 310 320 330 340 350 

gttccgaacaaccagcacaagctcttggttccacctcggat^aaagttccggtacagagaccactccact 
SSEQPAQALGSTSDQSSGTETTPL 
95 100 105 110 115 

360 370 380 390 400 410 420 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 
cgctcctcctatcaccacgtcggttaagtctgttgactcgaccatcttcttcaagttcccaccggtacaa 
APP ITTSVKSVDSTI FFKFPPVQ 
120 125 130 135 140 

430 440 450 460 470 480 490 

,...|....|....|....|....|....|....|....|....|....|....|....|....|....| : 

gcacaagctcttgcccctactgcttccggttcaacgcaagcccctgcttttggttttggtgcattcgctg 
AQALAPTASGSTQAPAFGFGAFA 

145 150 155 160 

500 510 520 530 540 550 560 

...•|....|....|....|....|....|....|....|....|....|....|....|....|....| 
ctcgcgtaccatctgccacctccggttgttcagcatttagtttcgcccctcctgttacatcggcaccagt 
ARVPSA-TSGCSAFSFAPPVTSAPV 
165 . 170 175 180 185 

570 580 590 600 610 620 630 

....|....|....|....|....|....|....|....|....f....|....|....|....|....l 

gcaagctctaggcacaaccactactactactactactacatccgcggccgctcctgcatctccatttcac 
QALGTTTTTTTTTSAAAPAS PFH 

190 195 200 205 210 

640 650 660 670 680 690 700 

....|....|....|....|....|....|....|....|....|....|....|....|....|.: ..| 

agttcctcaccaaccacattccaattccctcctgcttttacatcccttgctgcttctacttttccttctg 
SS SPTTFQF P PAFT S LAASTF P S 

215 220 225 230 

710 720 730 740 750 760 770 

....|....|....|....|....|....|....|....|....|.,..|....|....|....|....| 
t tgca teat caactt cat ctccacttgatgctcctccct caeca tttagatggggatcactgcaagctaa 
VASSTSSPLDAPPSPFRWGSLQAN 
235 240 245 250 255 
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Fig. 9 Continued 
780 790 800 810 820 830 840 

cacttccccaccctttagcttcttgccagcgcaaggttctgacaagactggttctgcttttactccaccg 
TSPP FS FLPAQGSDKTGSAFTPP 

260 265 270 275 280 

850 860 870 880 890 900 910 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

tttggctaccctggtggttttgccagacctgatgttggtgtctctcatccagggtttggtccctctaacc 
FGYPGGFARPDVGVSHPGFGPSN 

285 290 295 300 

920 930 940 950 960 970 980 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

attttggaccaaacgcaccaactactacacctgttcctgttcgcagtccatttttggctggtggtggaac 
H F G P N A — P TTTPVPVRS PFLAGGGT 

305 310 315 320 325 

990 1000 1010 1020 1030 1040 1050 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

tgaacaaggtagtaggtatcctcgttattcacctacaccagatgttgacggcaggctgataatgtccata 
EQGSRYPRYS PTPDVDGRLIMS I 

330 335 340 345 350 

1060 1070 1080 1090 1100 1110 1120 

.-..|....|....|....|....|....|....|....|....|....|....|....|....|....| 

tctgcttccaactcacatggacataaaagtcatgaagaattgaggtgggaagattacaaaaatggagaca 
SASNSHGHKS HEELRWEDYKNGD 

355 360 365 370 

1130 1140 1150 1160 1170 1180 1190 

aaggtgggtttgggtggtttcctcctgttcatacatctcccttttcctcaccaacggtatcaccgtcgct 

KG-GFGWFPPVHTSPFSSPTVSPSL 
375 380 385 390 395 

1200 1210 1220 1230 1240 1250 1260 

.••.|.-...|...-|....|....|....|....|....|..-.|...-|....|....|.-..|....| . 
atttgctcctccaagcatacctaatcgtcctcagatgagaactattgatctaacgaaccgagacatgtgt 
FAPPS I PNRP Q MRTIDLTNRDMC 

400 405 410 415 420 

1270 1280 1290 1300 1310 1320 1330 

••••I i — i — i — i — i — i — | — | — | — i .... i — i — i 

ggttttcctattggctacaacacccccgctgctttccagagaccccctgaacccgctggtgtttcttccc 
GFP I G — Y N T PAA FQR p PE P AGV S S 

425 430 435 440 

1340 1350 1360 1370 1380 1390 1400 

— I — j — | — | — | — | — | — | — | .... | .... | — | — | — | 

cagcatctggatgcacagcgtgtggagccacgagtaggtcctctccttctagtcacttgggcttgaacaa 
PAS GCTA CGAT S RS S P S S HLG LNN 
445 450 455 460 465 

1410 1420 1430 1440 1450 1460 1470 

taccacaaatcctccatcagctgcgacatctcttcccgggatgttcttttctacctatggttcttgtcct 
TTNPPSAAT S LPGMF. FSTYGSCP 

470 475 480 485 490 

1480 1490 1500 1510 1520 1530 1540 

•-•.|..-.|....|....|-..-|...-|....|....|....|....|....|....|....|....| 

ttgctgtttggctcaccaaatcttgcaacttatggtacaacagcaattccagcagtccaagcctatgcta 
LLFGSPNLATYGTTA IPAVQAYA 

495 500 505 510 
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Fig. 9 Continued 

1550 1560 1570 1580 1590 1600 1610 

,...|....|....|....|....|....|....|....|....|....|.-..|....|..-.|....| 
ttatgtttggggctccaaattttacttctcaaggtacaacggcaactccagcttttcaagcctttcctat 
IMFGAPNFTSQGTTATPAFQAFPI 
515 520 525 530 535 

1620 1630 1640 1650 1660 1670 1680 

....|....|....|....|.,..|....|....|....|....|....|....|....|....|...,| 
tatgtttgggactccaaatcttgctgctcaaggtactacaagagctccagctgttcaagcctatcctacg 
M FGT PN LA'AQGTT RA PAVQAY P T 

540 545 550 555 560 

1690 1700 1710 1720 1730 1740 1750 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 
atgtttggcacgccaaatattggagttcaaggttcaactccagcagctcaaacctatcctttgatgtttg 
MFGTPNIGVQGSTPA-AQTY_ P_-L M F 

565 570 575 580 

1760 1770 1780 1790 1800 1810 1820 

....|....|....|....|....|....|....|....|....|....|....|....|...-|.^.| 
gcaccccaaatcttgctgctcaaggtacaacaaatattggagctcgaggtacaactccagcagctcaagc 
GTPN LAAQ GTTNI GARGTT PAAQA 
585 590 595 600 605 

1830 1840 1850 1860 1870 1880 1890 

. ... | .... | .... | .... | .... | . . . • | • ... | .... | .... | . . . . | .... | .... | .... | .... | 
ctatccgttgatgtttggcaccccaaatcttgctgctcaaggtacaacaactccagcagttcagtcctat 
YPLMFGTPNLAAQGTTTPAVQS Y 

610 615 620 625 630 
1900 1910 1920 1930 1940 1950 1960 
| | | | | | | | | | | | | | 

cctacgatgtttggaacaccaaatctagctggtcaaagtacaacaacaactcgagcaggtcagccatatc 
PTMFGTPNIjAGQSTTTT ragqpy 

635 640 645 650 

1970 1980 1990 2000 2010 2020 2030 

....|....|....|....|....|....|.....|....|....|....|....|....|....|....| 

ctacgacgtttgctgttcctcaagctgcgacagctccagcagttcagccgtatgctatgatgtttggtac 
P T T FAV PQAATA P A V Q P YAMM FG T 
655 660 665 670 675 

2040 2050 2060 2070 2080 2090 2100 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

accaagtctcggagctcaagatatcactccaggaggtcaagcctatcccgctcatggtttaactctccca 
PSLGAQDITPGGQAYPAHGLTLP 

680 685 690 695 —7-00 

2110 2120 



ttcgccgccatgagtctgcagtaa 2124 (SEQ ID NO: 32) 
FAAMSLQ* (SEQ ID NO: 31) 

705 
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Fig. 13 Gene 2 6 cDNA and Peptide 
10 20 30 40 50 60 70 

atgttggctagggtttgtggattcaagtgttcttcttctcctgctgagtctgcggctagattgttctgta 
MLARVCGFKCSSSPAESAARLFC 

5 10 15 20 

80 90 100 110 120 130 140 

....|..^|....)....|....|....|....|....|....|.-..|....|....|.-..|..--| 

cgagatcgattcgtgatactctggccaaggcaagcggagagagttgcgaagcaggttttggaggagagag 
TRS IRDTLAKASGESCEAGFGGE S 
25 30 35 40 45 

150 160 170 180 190 200 210 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

tttgaagctgcaaagtgggtttcatgaaatcaaaggtttagaggatgcgattgatttgttcagtgacatg 

L K L "Q S GFHE I K G L E D— AIDLFSDM 

50 5.5 60 65 70 

220 230 240 250 260 270 280 

....|....|....|....|....|....|....|-...|....|....|..-.|....|....|....| 
cttcgatctcgtcctttaccttctgtggttgatttctgtaaattgatgggtgtggtggtgagaatggaac 
LRSRPL P S V VDF C KLMGVV VRME 

75 80 85 90 

290 300 310 320 330 340 350 

gcccggatcttgtgatttctctctatcagaagatggaaaggaaacagattcgatgtgatatatacagctt 
RPDLVI SLYQKMERKQ I RCDIYS F 
95 100 105 110 115 

360 370 380 390 400 410 420 

caatattctgataaaatgtttctgcagctgctctaagctcccctttgctttgtctacatttggtaagatc 
NIIilKCFCSCSKLPFALSTFGKI 

120 125 130 135 140 

430 440 450 460 470 480 490 

— I — i — i — i — I - • • -i — t — ! — I — I — i .... i .... i — i 

accaagcttggactccaccctgatgttgttaccttcaccaccctgctccatggattatgtgtggaagata 
TKLGLHPDVVTFTTLLHGLCVED 

145 150 155 160 

500 510 520 530 540 550 560 

gggtttctgaagccttggatttttttcatcaaatgtttgaaacgacatgtaggcccaatgtcgtaacctt 
RVS EALDFFHQMFE T — T C R P N V V T F 
165 170 175 180 185 

570 580 590 600 610 620 630 

.... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | 
caccactttgatgaacggtctttgccgcgagggtagaattgtcgaagccgtagctctgcttgatcggatg 
TTLMNGLCREGRIVEAVALLDRM 

190 195 200 205 210 

640 650 660 670 680 690 700 

at 99 a agatggtctccagcctacccagattacttatggaacaatcgtagatgggatgtgtaagaagggag 
MEDGLQPTQ I TYGT I VDGMCK KG 

215 220 225 230 

710 720 730 740 750 760 770 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 
atactgtgtctgcactgaatctgctgaggaagatggaggaggtgagccacatcatacccaatgttgtaat 
DTVSALNLiliRKMEEVSHI IPNVVI 
235 240 245 250 255 
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Fig. 13 Continued 
780 790 800 810 820 830 840 

| | | | | | | | | | | | ) | 

ctatagtgcaatcattgatagcctttgtaaagacggacgtcatagcgatgcacaaaatcttttcactgaa 
YSAI I DSL CKDGRHS DAQNLFTE 

260 265 > 270 275 , 280 

850 860 870 880 890 900 910 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

atgcaagagaaaggaatctttcccgatttatttacctacaacagtatgatagttggtttttgtagctctg 
MQ EKG I F PDL FTYNSM IVGFCSS 

285 290 295 300 

920 930 940 950 960 970 980 

gtagatggagcgacgcggagcagttgttgcaagaaatgttagaaaggaagatcagccctgatgttgtaac 
G R — W— SDAEQLLQEM-LER KI SPDVVT 
305 310 315 320 - 325 

990 1000 1010 1020 1030 1040 1050 

ttataatgctttgatcaatgcatttgtcaaggaaggcaagttctttgaggctgaagaattatacgatgag 
YNALINAFVKEGKFFEAEELYDE 

330 335 340 345 350 

1060 1070 1080 1090 1100 1110 1120 

.....|....|....|....|....|....|....|....|....|....|....|....|.--.|....| 

atgcttccaaggggtataatccctaatacaatcacatatagttcaatgatcgatggattttgcaaacaga 
MLPRGI I PNTITYSSMIDGFCKQ 

355 360 365 370 

1130 1140 1150 1160 1170 1180 1190 

atcgtcttgatgctgctgagcacatgttttatttgatggctaccaagggctgctctcccaacctaatcac 
NRLDAAEHMFYLMATKGCSPNLIT 
375 380 385 390 395 

1200 1210 1220 1230 1240 1250 1260 

....|.-..|....|....|....|....|....|....|....|....|...,|....|....|....| 

tttcaatactctcatagacggatattgtggggctaagaggatagatgatggaatggaacttctccatgag 
FNTL I DG Y CGAKR I DDGMELLHE 

400 405 410 415 420 

1270 1280 1290 1300 1310 1320 1330 

...-|....!....|....|....|....|....|....|....|....|....|....|....|....| 

atgactgaaacaggattagttgctgacacaactacttacaacactcttattcacgggttctatctggtgg 
MTETGLVADTTT Y— N T Ij ihgfylv 

425 430 435 440 

1340 1350 1360 1370 1380 1390 1400 

I — I I I — I ! | | | I 1 — I 1 I 

gcgatcttaatgctgctctagaccttttacaagagatgatctctagtggtttgtgccctgatatcgttac 
GDLN AALD LLQEM I S S GL.CPD IVT 
445 450 455 460 465 

1410 1420 1430 1440 1450 1460 1470 

ttgtgacactttgctggatggtctctgcgataatgggaaactaaaagatgcattggaaatgtttaaggtt 
CDTLLDGL CDNG KLKDALEM F KV 

470 475 480 485 490 

1480 1490 1500 1510 1520 1530 1540 

.-..|....|....|....|....|....|....|....|....|....|....|....|...,|....| 

atgcagaagagtaagaaggatcttgatgctagtcaccccttcaatggtgtggaacctgatgttcaaactt 
MQKSKKDLDASHPFNGVEPDVQT 

495 500 505 510 
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Fig. 13 Continued 
1550 1560 1570 1580 1590 1600 1610 

acaatatattgatcagcggcttgatcaatgaagggaagtttttagaggccgaggaattatacgaggagat 
YNILI SGLINEGKFLEAEELYEEM 
515 520 * 525 530 535 

1620 1630 1640 1650 1660 1670 1680 

gccccacaggggtatagtcccagatactatcacctatagctcaatgatcgatggattatgcaagcagagc 
PHRGIV PDT ITYS SM IDGLCKQS 

540 545 550 555 560 

1690 1700 1710 1720 1730 1740 1750 

cgcctagatgaggctacacaaatgtttgattcgatgggtagcaagagcttctctccaaacgtagtgacct 
RLDEATQMFDSMGSKSFSPN VVT 

565 570 575 580 

1760 1770 1780 1790 1800 1810 1820 

ttactacactcattaatggctactgtaaggcaggaagggttgatgatgggctggagcttttctgcgagat 
FTTLINGYCKAGRVDDGLELFCEM 

585 590 595 600 605 

1830 1840 1850 1860 1870 1880 1890 

gggtcgaagagggatagttgctaacgcaattacttacatcactttgatttgtggttttcgtaaagtgggt 
GRRGIVANAITYITL I CGFR KVG 
610 615 620 625 630 

1900 1910 1920 1930 1940 1950 1960 

| | | | ) | I I I I I I I I 

aatattaatggggctctagacattttccaggagatgatttcaagtggtgtgtatcctgataccattacca 

NINGALD I FQEMI S SGVYPDT IT 

635 640 645 650 

1970 1980 1990 2000 2010 2020 2030 

— I — | — | — I ... - 1 — I — I — I — I — I — I • • •.. I — I — I 

tccgcaatatgctgactggtttatggagtaaagaggaactaaaaagggcagtggcaatgcttgagaaact 
IRNMLTGLWS KEELKRAVAMLEKL 

655 660 665 670 675 

2040 2050 2060 2070 2080 2090 2100 

— I — I — I — I — I — I — I — I — I — i — I — I — i — l 

gcagatgagtatggtatgtaagtttctgttcagtctatgtattttttatataaacaagaatgtatacatt 
Q M~ SMVCKFLFSLC I FYIN. KNVYI 

680 685 690 695 700 

2110 2120 

....|....|....|....|.... 

cttttgtgtgtagcttcagattga 2124 (SEQ ID NO: 89) 
LLCVASD* (SEQ ID NO : 88) 

70S 
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